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Abstract 

We define the notion of a transitive-closure spanner of a directed graph. Given a directed graph 
G = {V,E) and an integer > 1, a k-tmnsitive-closure-spanner (k-TC-spanner) of G is a directed 
graph H = (V, Eh) that has (1) the same transitive-closure as G and (2) diameter at most k. These 
spanners were studied implicitly in access control, property testing, and data structures, and properties 
of these spanners have been rediscovered over the span of 20 years. We bring these areas under the 
unifying framework of TC-spanners. We abstract the common task implicitly tackled in these diverse 
applications as the problem of constructing sparse TC-spanners. 

We study the approximability of the size of the sparsest fc-TC-spanner for a given digraph. Our 
technical contributions fall into three categories: algorithms for general digraphs, inapproximability 
results, and structural bounds for a specific graph family which imply an efficient algorithm with a good 
approximation ratio for that family. 

Algorithms. We present two efficient deterministic algorithms that find fc-TC- spanners of size ap- 
proximating the optimum. The first algorithm gives an 0(n^^^/'^)-approximation for fc > 2. Our 
method, based on a combination of convex programming and sampling, yields the first sublinear approx- 
imation ratios for (1) Directed fc-SPANNER, a well-studied generalization of Ai-TC-Spanner, and (2) 
its variants Client/Server Directed fc-SPANNER, and the fc-DiAMETER Spanning Subgraph. 
This resolves the main open question of Elkin and Peleg (IPCO, 2001). The second algorithm, specific 
to the fc-TC-spanner problem, gives an 0(n/fc^)-approximation. It shows that for k = Q.{y/n), our 
problem has a provably better approximation ratio than DIRECTED fc-SPANNER and its variants. This 
algorithm also resolves an open question of Hesse (SODA, 2003). 

Inapproximability. Our main technical contribution is a pair of strong inapproximability results. We 
resolve the approximability of 2-TC-spanners, showing that it is 6 (log n) unless P = NP. For constant 

fc > 3, we prove that the size of the sparsest fc-TC-spanner is hard to approximate within 2'°^ for 
any e > 0, unless NP C DTIME(nP°'^'°s ") . Our hardness result helps explain the difficulty in designing 
general efficient solutions for the applications above, and it cannot be improved without resolving a long- 
standing open question in complexity theory. It uses an involved application of generalized butterfly and 
broom graphs, as well as noise-resilient transformations of hard problems, which may be of independent 
interest. 

Structural bounds. Finally, we study the size of the sparsest TC-spanner for iJ-minor-free digraphs, 
which include planar, bounded genus, and bounded tree-width graphs, explicitly investigated in applica- 
tions above. We show that every if-minor-free digraph has an efficiently constructable fc-TC-spanner of 
size 0{n), which implies an (5(l)-approximation algorithm for this family. Furthermore, using our in- 
sight that 2-TC-spanners yield property testers, we obtain a monotonicity tester with 0(log^ n/e) queries 
for any poset whose transitive reduction is an if-minor free digraph. This improves and generalizes the 
previous 9(\/nlogn/e)-query tester of Fischer etal (STOC, 2002). 
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1 Introduction 



A spanner can be thought of as a sparse backbone of a graph that approximately preserves distances between 
every pair of vertices. More precisely, a subgraph H = {V, Eh) is a k-spanner of G = (F, E) if for every 
pair of vertices u, v G V, the shortest path distance dniu, v) from u to t; in is at most k • dciu, v). 
Since they were introduced by Peleg and Schaffer [40] in the context of distributed computing, spanners for 
undirected graphs have been extensively studied. The tradeoff between the parameter k, called the stretch, 
and the number of edges in a spanner is relatively well understood: for every k > 1, any undirected graph 
on n vertices has a (2k — l)-spanner with ©(n^+^Z'^) edges [6, 39, 52]. This is known to be tight for 
k = 1, 2, 3, 5 and is conjectured to be tight for all k (see, for example a survey by Zwick [55]). Undirected 
spanners have numerous applications, such as efficient routing [16, 17, 42, 44, 51], simulating synchronized 
protocols in unsynchronized networks [41], parallel and distributed algorithms for approximating shortest 
paths [14, 15, 20], and algorithms for distance oracles [9, 52]. 

In the directed setting, two notions of spanners have been considered in the literature: the direct gen- 
eralization of the above definition [40] and roundtrip spanners [17, 44]. In this paper, we introduce a new 
definition of directed spanners that captures the notion that a spanner should have a small diameter but 
preserve the connectivity of the original graph. 

Definition 1.1 (TC-spanner). Given a directed graph G = {V, E) and an integer k > 1, a k-transUive- 
closure-spanner (k-TC-spanner) is a directed graph H = {V, Eh) with the following properties: 

1. Eh is a subset of the edges in the transitive closure ofG. 

2. For all vertices u,v eV, ifdciu, v) < oo, then dniu, v) < k. 

Notice that a /c-TC-spanner of G is just a directed spanner of the transitive-closure of G with stretch k. 
Nevertheless, a A;-TC-spanner is interesting in its own right due to the numerous TC-spanner-specific appli- 
cations we present in Section 1.3. 

One of the focuses of this paper is the study of the computational problem of finding the size of the 
sparsest /c-TC-spanner for a given digraph, referred to as fe-TC-SPANNER. It is a special case of the problem 
of finding the size of the sparsest directed spanner, called Directed /c-Spanner, that has been previously 
studied. Both problems are NP-hard (see Appendix F). 

1.1 Related Work 

Thorup [47] considered a special case of TC-spanners of graphs G that have at most twice as many edges 
as G, and conjectured that for all directed graphs G with n vertices there are such TC-spanners with stretch 
polylogarithmic in n. He proved his conjecture for planar graphs [48], but later Hesse [33] gave a coun- 
terexample to Thorup's conjecture for general graphs. TC-spanners were also studied for directed trees: 
implicitly in [5, 8, 12, 18, 54] and explicitly in [49]. For the directed line, [5] (and later, [8]) showed that the 
size of the sparsest A;-TC-spanner is 6(n • Afe(n)), where Afe(n) is the k^^-rov/ inverse Ackermann function. 
[5, 12, 49] gave the same bounds for rooted directed trees. 

Approximability of directed spanner problems. All algorithms for Directed A;-Spanner immedi- 
ately yield algorithms for A;-TC-Spanner with the same approximation ratio. Kortsarz and Peleg [37] 
give an O (log n) -approximation algorithm for DiRECTED-2- Spanner, and Kortsarz [35] shows that this 
approximation ratio cannot be improved unless P=NP. For A; = 3, Elkin and Peleg [21] present an 0(n^/^)- 
approximation algorithm. Their algorithm is comphcated, and the polylog factor hidden in the O notation is 
not analyzed. For k > A, sublinear factor approximation algorithms are known only in the undirected setting 
[40]. We note that Dodis and Khanna [19] and Chekuri et al. [13] study algorithms that might seem relevant 
to A;-TC-Spanner. In Appendix A.l we explain why these algorithms do not work for A;-TC-Spanner. 
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Table 1: Summary of Results on Approximability of A;-TC-Spanner 



For any constant k > 2 and < e < 1, it is hard to approximate DIRECTED fc-SPANNER within a factor 
of 2'°e'-' assuming NPgDTIME(n^'°'2/iogn^ ^1]. Moreover, [24] extend this result to 3 < A; = 0{-n}-^) 
for any < (5 < 1. Thus, according to Arora and Lund's classification [34] of NP-hard problems. Directed 
/c-Spanner is in class 111, for 3 < /c = 0{n^~^). Moreover, [24] show that proving that Directed k- 
Spanner is in class IV, that is, inapproximable within for some < 5 < 1, would resolve a long 
standing open question in complexity theory, and cause classes IH and IV to collapse into a single class. 

1.2 Our Contributions 

The contributions of this paper are the following: (1) we bring several diverse applications, including prop- 
erty testing, access control and data structures, under the unifying framework of TC-spanners, (2) we obtain 
strong bounds on the approximability of A;-TC-Spanner and Directed A;-Spanner as well as some well- 
studied variants of these problems, and (3) we characterize the exact size of TC-spanners and obtain better 
bounds for the family of i?-minor free graphs, which include planar, bounded-treewidth, and bounded genus 
graphs. Our results on the approximability of fc-TC- SPANNER are summarized in Table 1. 

Algorithms for /c-TC-Spanner and Related Problems. We present two deterministic polynomial time 
approximation algorithms for A;-TC- SPANNER. Our first algorithm uses a new combination of convex 
programming and sampling, and gives an 0((n log n)-'^~^/'^)-ratio for A;-TC-Spanner. Moreover, our 
method yields the same approximation ratio for DIRECTED /c-Spanner and its well-studied variants: 
Client/Server Directed /c-Spanner, and ^-Diameter Spanning Subgraph (see [22] for defi- 
nitions). This resolves the open question of finding a subUnear approximation ratio for these problems for 
k > ?>, described as a "challenging direction" for research on directed spanners by Elkin and Peleg [23]. 
Our algorithm for A; = 3 is arguably simpler than the 0(n^/^ polylog n) -approximation algorithm of [23]. 

Our second algorithm has an 0(n/fc^) ratio for fc-TC-SPANNER. This demonstrates a separation be- 
tween A;-TC-Spanner and Directed A;-Spanner: for k = ^/n, it gives O (log n) -approximation for A;- 
TC- Spanner while [24, Theorem 6.6] showed that Directed y^- Spanner is 2'°s^"' "-inapproximable. 
Moreover, Hesse [33] asks for an algorithm to add 0(|G|) "shortcuts" to a digraph and reduce its diameter 
to ^/n. Our second algorithm returns a y^-TC-spanner of size OdGj + logn), answering his question. 

Inapproximability of /c-TC-Spanner. We present two results on the hardness of /c-TC-Spanner. First, 
we prove for = 2 that the O(logn) ratio of [37] is optimal unless P=NP. Next, we show that for 
constant A; > 2, A;-TC-Spanner is inapproximable within a factor of 2'°^ for any e > 0, unless 
NPCDTIME(nP°'y^°s"). This result is our main technical contribution. Observe that a stronger inapprox- 
imability result for A; > 2 would imply the same inaproximability for DIRECTED- A;-Spanner, and as shown 
in [24], collapse classes III and IV in Arora and Lund's classification. 

Our 2'°s ^"-hardness matches the known hardness for Directed A;- Spanner. As is the case for 
Directed A;-Spanner, we start by building a directed graph from a well-known hard problem called 
MIN-REP, which has the same inapproximability as Symmetric Label Cover. However, as illus- 
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trated in Section 3, all known hard instances for DIRECTED A;-Spanner cannot imply anything better than 
r2(l)-hardness for /c-TC-Spanner. Intuitively, our lower bound is much harder to prove than the one for 
Directed A;-Spanner since our instance must be transitively-closed, and thus, many more "shortcut" 
routes between pairs of vertices exist. Our construction uses a novel application of the generalized butterfly 
and broom graphs, together with several transformations of the MIN-REP problem, which make it noise- 
resilient. We call a MIN-REP instance noise-resilient to indicate that its structure is preserved under small 
perturbations. The paths in the generalized butterfly are well-structured, which allows us to analyze the many 
different routes possible in the transitive closure. To realize these ideas, we perform various transformations 
on a MIN-REP instance to coordinate it with multiple copies of butterflies and brooms. 

Structural Results. Finally, we study the minimum fc-TC-spanner size for a specific graph family with 

sparse fe-TC-spanners: iJ-minor-free graphs. A graph H is a. minor of G if -ff is a subgraph of a graph 
obtained from G by a sequence of edge contractions and deletions. For a fixed graph H (e.g., K^), the 
family of H -minor-free graphs is a minor-closed family that excludes H. Examples of such families include 
planar graphs, bounded treewidth graphs, and bounded genus graphs, explicitly studied in applications in 
Section 1.3. For i7-minor-free graphs, we efficiently construct 2-TC-spanners of size 0(n log^ n), and A;- 
TC-spanners of size 0{n • logn • Afc(n)), where Afc(-) is the /c-row inverse Ackermann function. The main 
idea is to use the path separators for undirected H-rmaov free graphs due to Abraham and Gavoille [1]. 
However, although the separators are paths, in our digraph they may be the union of many dipaths, and so 
we cannot efficiently recurse using the sparse A;-TC-spamiers for the directed line of Alon and Schieber [5]. 
We observe that these separators satisfy a stronger property than claimed in [1], effectively allowing us to 
encode the direction of edges in a cost function associated with the separators. 

1.3 Applications of TC-spanners 

Monotonicity Testing. Monotonicity of functions [4, 10, 18, 25, 27, 28, 30, 32] is one of the most studied 
properties in property testing [31, 45]. Fischer et al. [28] prove that testing monotonicity is equivalent to 
several other testing problems. Let \4i be a poset of n elements and G„ = {Vn, E) be the relation graph, i.e., 
the Hasse diagram, for V^. A function / : 14 — *• M is called monotone if f{x) < f{y) for all {x, y) G E. 
We say / is e-far from monotone if / has to be changed on > e fraction of the domain to become monotone, 
that is, minmonotone g \{x ■ f{x) / g{x)}\ > <^n. A monotonicity tester on G„ is an algorithm that, given an 
oracle for a function / : y„ ^ M, passes if / is monotone but fails with probability > | if / is e-far from 
monotone. The optimal monotonicity tester for the directed line L„, consisting of vertices {1, 2, . . . , n} 
and edges + 1) : 1 < f < n — 1}, proposed by Dodis et al. [18], is based on the sparsest 2-TC- 
spanner for that graph. Implicit in the proof of Proposition 9 in [18] is a lemma relating the complexity 
of a monotonicity tester for L„ to the size of a 2-TC-spanner for L„. We generalize this by observing that 
a sparse 2-TC-spanner for any partial order graph G„ implies an efficient monotonicity tester on G„. In 
Appendix A.2, we prove the following lemma. 

Lemma 1.1. If a directed acyclic graph Gn has a 2-TC-spanner with s{n) edges, then there exists a mono- 
tonicity tester on Gn that runs in time O ( ) . 



Therefore, all the 2-TC-spanner constructions described in this paper yield monotonicity testers for functions 
defined on the corresponding posets. Moreover, for iJ-minor free graphs, the resulting tester has much 
better query complexity than the previously known, due to Fischer et al. [28]. Indeed, we achieve testers 
with 0(log^ n/e) queries, whereas previous testers required &{y/n/e) queries. 

Key Management in an Access Hierarchy. In the problem of key management in an access hierarchy, 
i.e., access control, there is a partially ordered set (poset) of access classes and a key associated with each 
class. This is modeled by a directed graph G whose nodes are classes and whose edges indicate an ordering. 
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A user is entitled to access a certain class and all classes reachable from it. This problem arises in content 
distribution, operating systems, and project development (see, e.g., the references in [8]). One approach to 
the access control problem [7, 8, 46] is to associate public information P{i, j) with each edge {i, j) G G and 
a secret key ki with each node i. There is an efficient algorithm A which takes ki and P{i, j) and generates 
kj. However, for each (i, j) in G, it is computationally hard to generate kj without knowledge of ki. To 
obtain a key k^ from a key ku, algorithm A is run dciu, v) times. To speed this up, [8] suggest adding edges 
to G to increase connectivity. To preserve the access hierarchy of G, new edges must be from the transitive 
closure of G. The number of edges added corresponds to the space complexity of the scheme, while the 
shortest-path distances correspond to the time complexity. Implicit in [8] are TC-spanners for directed trees 
with k = 3 and size 0{n log log n) and also with k = 0(log log n) and size 0(n). Our results for //-minor 
free graphs extend the known posets for which access control schemes have O (n poly log n) storage and 
0(1) key derivation time. Our approximation algorithms yield sparser fe-TC-spanners for general posets. 

Partial Products in a Semigroup. Yao [54] and Alon and Schieber [5] study space-efficient data struc- 
tures for the following problem: Preprocess elements {si, . . . , Sn} of a semigroup {S, o), such as (M, min), 
to be able to compute partial products Sj o Si+i o • • • o for all 1 < i < j < n with at most k queries to a 
small database of pre-computed partial products. This problem reduces to finding a sparsest fc-TC-spanner 
for a directed line Ln+i- Chazelle [12] and Alon and Schieber also consider a generalization of the above 
problem, where the input is an (undirected) tree T with an element Sj of a semigroup associated with each 
vertex i. The goal is to create a space-efficient data structure that allows one to compute the product of 
elements associated with all vertices on the path from i to j, for all vertex pairs i,j in T. The generalized 
problem reduces to finding a sparsest fe-TC-spanner for a certain directed tree T' obtained from T. We 
describe the reduction in Appendix A. 3. The same reduction to A;-TC-spanners can be used to design space- 
efficient data structures for any digraph with a unique path between pairs of nodes. Our structural results 
imply new space-efficient data structures for //-minor free graphs with unique paths, and our approximation 
algorithms yield more space-efficient data structures for general digraphs with unique paths. 

Organization. Section 2 contains an overview of our algorithms. In Section 3, we give an overview of 
our lower bounds and the techniques involved. Section 4 contains an overview of our bounds for minor-free 
graphs. We defer the details and proofs of our results to the Appendix. In Appendix A, we discuss the previ- 
ous work and applications to monotonicity testing and partial products in a semigroup. Appendix B contains 
our algorithms. In Appendix C, we give our 2^°^ "-inapproximability for k > 2, and in Appendix D we 
give our r2(log n)-inapproximability for A; = 2. In Appendix E, we give the proofs of our structural results. 

Notation. The transitive closure of a graph G = {V,E), denoted TC{G), is defined as the directed graph 

{V,E'), where E' = {{u,v) : u v}. Vertices u and v are comparable if either (u, v) G TC{G) or 
(f , u) G TC{G). The transitive reduction of G, denoted TR{G), is a digraph G' with the fewest edges for 
which TC{G') = TC{G). As shown by Aho et al [3], TR{G) can be computed efficiently via a greedy 
algorithm. For directed acyclic graphs TR{G) is unique, and G is transitively reduced if TR{G) = G. 

For fc > 1, we define: Sk{G) = min//{|//| : // is a A;-TC-spanner of G}, and we call H that achieves 
this minimum a sparsest k-TC-spanner. Clearly, Sk{G) > \TR{G)\. The Ackermann function [2] is defined 
by: A{l,j) = 2^ ^(i + 1,0) = A{i,l), A{i + l,j + 1) = A{i,2^^^'^^''^). The inverse Ackermann function 
is a{n) = min{i : A{i, 1) > n} and the i^^-rov/ inverse is \i{n) = m.m{j : A{i, j) > n}. 

2 Overview of Algorithms for A; -TC- Spanner and Related Problems 

Our 0((n log n)^~^/'^)-approximation for fc-TC-SPANNER for arbitrary k is based on anew combination of 
convex programming and samphng. The technique also achieves an ©((nlogn)^"^/*^) ratio for DIRECTED 
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A:-Spanner, Client/Server Directed /c-Spanner, and /c-Diameter Spanning Subgraph. Here 
we describe the result for DIRECTED /c-Spanner. To achieve the same result for /c-TC-Spanner, it 
suffices to run the algorithm on the transitive-closure of the input digraph. Missing proofs are in Appendix B. 

Theorem 2.1. For any (not necessarily constant) k > 2, there is a deterministic polynomial-time algorithm 
achieving an 0{{nlogny^^^'^)-approximation for DmECTED A;-Spanner. 

We start by formulating the problem as an integer program. We briefly explain the problems with this 
approach and the ideas required to make it work. One can introduce binary edge variables Xe for each edge e 
in the transitive closure, and binary path variables yp for each path P of length < kin the transitive closure. 
One enforces the constraints yp < Xg for each e G P, which allow a path P in the spanner only if all edges 
along it are present. The final constraint is J2p Up ^ ^ for all edges (u, v) G G, where the sum is over paths 
P of length < k from uto v. Finally, one can relax the problem to an LP, and try to round the solution. 

The first problem is that the integrality gap is huge, which may be why an LP approach had not been 
considered before. Indeed, if there are n paths of length at most k between u and v, the LP might assign 
each of them a value of 6(l/n). However, we observe that if there are r = n^"^/*^ distinct paths from u to 
V of length < k, there must be > r^/(''~^) distinct vertices w for which u v. Let BFS{v) denote a 

shortest path tree of edges directed away from v, together with a shortest path tree of edges directed towards 
V. Then if we sample 0{n/r^/^''~^^) vertices, and grow BFS{w) of 2{n — 1) edges around each sample w, 
we will sample a w for which w ^ v, and the path from utov along the edges in BFS{w) has length 
at most k. We take the spanner H to be the union of the outputs of the LP and sampling-based algorithms. 

2. For each edge e G G, if Xe > t-, — ^t— ttf. H ^ H D |e|. 

3. Randomly sample r = ©((nlogn)^"^/*^) vertices zi, Z2, ■ ■ ■ ,Zr E G. 

4. H^HU {UiBFSjzi)). Output H. 

With high probability, an edge {u, v) is covered by either the LP relaxation or the sampling. 

Lemma 2.2. With probability at least 1 — 1/n, H is a k-TC-spanner ofG. 

2 

The spanner has at most r ■ OPT + ^i/lk-i) edges, where OPT is the optimum of the LP. By ob- 
serving that any spanner must have size min(OPr, n — 1), one can guarantee that this is an 0(n^~^/'^)- 
approximation. Note that we assume that G is connected, as otherwise we can run the algorithm sepa- 
rately on each component. A more careful analysis gives an 0((n log n)^~^/'^)-approximation, and a simple 
greedy algorithm derandomizes the sampling. 

Lemma 2.3. \H\ = 0((n\ogn)^'^/''OPT). 

The problem with this approach is that the number of variables and the size of each of the constraints 
grows exponentially with k. We replace the variables yp with minggp Xe, reducing the number of variables 
to O(n^). The resulting program is convex, and we use the ellipsoid algorithm with a separation oracle. The 
oracle, given x, just needs to find one pair of vertices (m, v) for which the constraint Xlp u^i; ™™ee p^e > I 
is violated. It can do this by sorting the coordinates of x, and counting the number of u-v paths P for which 
some particular Xg is the minimum edge variable along P. For this, it iteratively removes edges e from G 
for which Xe is smallest, and uses matrix multiplication to count the u-v paths that remain in the graph. 

Lemma 2.4. For any k, there exists a separation oracle which runs in time poly(n). 

Our 0(n/A;^) -approximation algorithm, which is specific to /c-TC-Spanner, works by sampling 0{n/k) 
vertices and selectively including 0{n/k) edges in the transitive closure adjacent to the samples. We also 
include the edges of TR{G) in the spanner. A simple greedy algorithm derandomizes the sampling. 

Theorem 2.5. For any k, there exists a deterministic approximation algorithm for the fc-TC- Spanner 
problem with approximation ratio 0{{n log n) / (A;^ -|- k log n)). 
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3 Overview of Hardness Results for /c-TC-Spanner 



This section outlines the proof of Theorem 3.1, which is our main technical contribution. Missing details 
are in Appendix C. At the end we briefly describe the ideas behind the inapproximability result for 2-TC- 
Spanner that appears in Appendix D. 

Theorem 3.1. For any fixed e G (0, 1), the size of the sparsest k-TC-spanner cannot be approximated to 
within a factor of 2^°^^'"'' unless NP C DT/ME(nP°iy'°g"). 

3.1 The Construction and its Motivation 

Since /c-TC-Spanner is a special case of DIRECTED fc-SPANNER, which is 0(logn)-inapproximable for 
k = 2 and 2^°^ " "-inapproximable for > 3, it is natural to ask whether the hard instances of DIRECTED 
A;-Spanner from [35, 21, 24] can be used to prove hardness for fc-TC-SPANNER. It turns out that all 
these instances have very small fc-TC-spanners. We demonstrate it for the instance used in the proof of 
ri(log n)-hardness for Directed A:-Spanner, which works via a reduction from Set-Cover. 

Let G be a bipartite digraph for Set-Cover with n vertices ("sets") on the left, n vertices ("elements") 
on the right, and edges from left to right. Let / be a set of i new independent vertices, for some value i, and 
let L be a directed line on A; — 1 new vertices. Call the first vertex of L the head, and the last vertex the tail. 
Include directed edges (1) from the tail of L to every set in G, (2) from every vertex of / to the head of L, 
and (3) from every vertex of I to the sets and the elements of G. Call the constructed digraph G'. 

Observe that in G', all directed edges except those from I to G must be included in the directed k- 
spanner, as such edges form the unique path between their endpoints. At this point, the only pairs of vertices 
at distance larger than k are those from a vertex in / to an element of G. Since these vertices are adjacent 
in G', there must be a path of length at most k in the spanner. The only possible path is from the vertex in / 
to a vertex of G. It is easy to see that adding exactly OPT edges from each vertex in J to the sets of G is 
necessary and sufficient to obtain a spanner, where OPT is the size of the minimum set-cover. By making 
i sufficiently large, the size of the spanner is easily seen to be 0(i • OPT), and thus one can approximate 
Set-Cover by approximating Directed /c-Spanner, so the problem is ri(log n)-inapproximable. 

However, there is a trivial fc-TC- spanner for this instance! Indeed, by transitivity we can simply connect 
the head of L to each of the elements of G. This is a /c-TC-spanner of size proportional to the number 
of vertices in G'. Thus, the best one could hope for with this instance is to show n(l)-hardness for k- 
TC-Spanner. For similar reasons, the instance showing 2^°s ^"-inapproximability for DIRECTED k- 
Spanner also cannot estabhsh anything beyond 0(l)-hardness for A:-TC-Spanner. 

In the example above there are many paths to cover (those from / to elements of G), but a few "shortcut" 
edges cover them all. Ideally, we would have many paths to cover, and each shortcut edge could only cover 
a single path. Hesse's digraph requiring a large number of shortcuts to reduce its diameter [33] satisfies the 
desired condition. His idea was to associate vertices with a subset V of vectors in such that {u, v) & E 
iff n — v is an extreme point of the d-dimensional ball of integer points. By the properties of an extreme 
point, a shortcut can cover at most one path from a large family of shortest paths. 

However, to achieve an inapproximability result, we need better structured graphs. We use generalized 
butterflies defined in [53]. In these digraphs vertices are identified with coordinates [n^/'^]*^ x [fe + 1], and 
an edge connects u= (-ui, . . . , rtfc, i)\.ov = {v\, . . . , 'Ufc, z -I- 1) iff for all j ^ i, uj = vj. We say a vertex 
(ui, ... ,Uk,i) is in strip i. It is easy to see that there is a unique shortest path of length k from any u in 
strip 1 to any v in strip k + 1. Moreover, any shortcut is on at most n^^^/'^' such paths because if it connects 
a vertex in strip i with a vertex in strip i + i (where ^ > 2) it fixes all but i — 1 coordinates of u and all but 
k + 1 — {i + i) coordinates of v. Thus, > n^+^Z*^ shortcuts are needed to reduce the diameter to — L 
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Reduction from MIN-REP. To get 2^°§ "-inapproximability, we reduce from the MIN-REP prob- 
lem. An (n, r, d, m)-MIN-REP instance is a bipartite graph of maximum degree d in which the left part 
can be partitioned into sets Ai,. . . ,Ar and the right part into sets Bi,. . . , Br, so that \Ai\ = \Bi\ = n/r 
for all z G [r]. To describe the last parameter m, call a vertex isolated if its degree is 0, and non-isolated 
otherwise. Let m{Ai) be the inverse of the fraction of non-isolated vertices in Ai. Then m is the mini- 
mum such m{Ai). Define the supergraph to have nodes Ai, . . . ,Ar,Bi, . . . ,Br, with a superedge {Ai, Bj) 
iff there is a node in Ai adjacent to a node in Bj. A rep-cover is a vertex set S in the graph such that 
whenever [Ai, Bj) is an edge in the supergraph, there is an edge between some u,v e S with u e Ai and 
V & Bj. A solution to MIN-REP is a smallest rep-cover, and its size is denoted by OPT. The problem is 
2i°s'"'"-inapproximable [21]. 

As a first attempt, we construct a graph of diameter -|- 2 as follows. We attach a disjoint copy of a 
generalized butterfly of diameter A; — 1 to each Ai in the MIN-REP instance graph; that is, we identify 
the vertices in Ai with the last strip of the butterfly. We call the vertices in the butterfly at distance x from 
Ai the x-th shadow of Ai. Next, for each Bj, we attach what we call a broom. This is a 3-layer graph, 
where the two leftmost layers form a bipartite clique, and the right layer consists of degree- 1 nodes, called 
broomsticks, attached to vertices in the middle layer. Each vertex in the middle layer has the same number 
of broomsticks attached to it. Each Bj is identified with the left layer of a disjoint broom. All edges are 
directed from the shadows of the Ai towards the broomsticks (left to right). Call the resulting digraph G. 

We would like to argue that the minimum fc-TC-spanner if of G is formed as follows. Let 5 be a 
minimum rep-cover of the underlying MIN-REP instance. For each s G 5, if s is in an Ai, include all 
shortcuts from the 2-shadow of Ai to s which are in the transitive closure of G. Otherwise (s is in a Bj), 
include all shortcuts from s to the broomsticks of Bj . By balancing the number of broomsticks with the size 
of 2-shadows, one can show H has size \S\f{n, k), where /(n, k) is an easily computable function. Since 
S" is a rep-cover, H is a fc-TC-spanner. If H were optimal, then approximating its size within some factor 
would approximate MIN-REP within the same factor. 

It turns out that H is not optimal, and so our first attempt does not work. However, by modifying G 
via the transformations below, and by looking at a related fc-TC-spanner H of the modified G, we can 
show that any fc-TC-spanner must have size Q,{\H\/ logn) for constant k. Since MIN-REP is 2'°^ 
inapproximable, this still gives 2'°^ " "-hardness. 

To prove this, we need to argue that most vertices v in the A;-shadows do not "benefit" from traversing 
other shortcuts to reach the broomsticks. This requires a classification of all alternative routes from such v 
to broomsticks. Given that u is in a generalized butterfly, these routes are well-understood. However, for a 
generic MIN-REP instance, most of these routes do indeed lead to a much smaller A;-TC-spanner! 

To rule out the alternative routes, we ensure that OPT and the four parameters of the MIN-REP instance 
each lie in a narrow range. In Theorem 3.2, we prove that MIN-REP with the required parameter restrictions 
is inapproximable by giving a reduction from an unrestricted MIN-REP instance. It works by carefully 
interleaving the following five operations on a "base" MIN-REP instance with unrestricted parameters: (I) 
disjoint copies, (2) dummy vertices inside clusters, (3) blowup inside clusters with matching supergraph, (4) 
blowup inside clusters with complete supergraph, and (5) tensoring. Each operation increases one or several 
parameters by a prespecified factor, and together they give us five degrees of freedom to control the range 
of OPT and the four parameters of MIN-REP. 

Theorem 3.2 (Noise-ResUient MIN-REP is hard). Fix any k G (0, 1) and R, D, M, F e {0,1 - k) 

satisfying F G {R, 2R) and D + M + F < 1. Noise-Resihent MIN-REP is a family of{n, r, d, m)-MIN- 
REP instances with r G [n^,n^'+% d G [n^,n^+% m G [n^ ,n'^+% and OPT G [n^,n^+% This 
problem is 2^''^''' -inapproximable for all e G (0, 1) unless NP C DTIME{'nP°^y^°^''). 

The variant of MIN-REP in Theorem 3.2 is called "noise-resilient" because even if many vertices in 
the sets Ai and Bj are adversarially deleted in an instance of this problem, the minimum rep-cover does 
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Figure 1 : The TC-spanner instance Q, and an example of a broom. 



not shrink significantly. This property helps us rule out many alternative routes in the TC-spanner, though 
we will need to change our graph G. Our reduction from noise-resilient MIN-REP to fc-TC-SPANNER for 
k > 2 consists of two steps: first we produce a specialized MIN-REP instance I from an arbitrary instance 
To of noise-resilient MIN-REP, and then we construct a fc-TC- Spanner instance Q by carefully adjoining 
generaUzed butterfly graphs on the left and broom graphs on the right of X. 

From Noise-resilient MIN-REP to Specialized MIN-REP. Set d = Ifl, V = 2(4fc-4)(4fc-2) ' 

C = ^ (ll^ + 4fe^) • be a sufficiently small positive constant which will be chosen in the course of 
the proof. We start from an (no, tq, do, mo)-instance Tq of noise-resilient MIN-REP with optimum OPTq, 
where no = n^ro G [n^/^,n^/^+''],do G [n'^,n'^+% mo G [n^'>,n^'^+% and OPTq G [n^,n^+'"]. By 
instantiating Theorem 3.2 with R= \,D = ^,M = F = ^ and k, we obtain that the (no, ro,do, mo)- 
MIN-REP problem is 2'°^^-^ n.^jj^ppj-Qximable unless NP C DTIME{nP°^y^°s''). The conditions on the 
parameters in Theorem 3.2 are satisfied because C G (f , (^) and ij + 2ri + ( < 5. 

We transform Xq to a specialized (n, r, d, m)-MIN-REP instance X by applying onXo the transformation 
T4 defined in the proof of Theorem 3.2 in Appendix C. More precisely, set X = r4(Xo, n^~^). By definition 
of T4, graph X has n vertices, r = ro, d = don}~^ and m = mo. The transformation results in a bipartite 
graph X with nodes partitioned into clusters .4i , . . . , .A,- on the left, and Bi, ... ,Br on the right. Each Ai 
and Bj is a union of n^~^ groups Ai^s and Bj^s, respectively, with s G [n^~^]. Each group Ai^s and Bj^s, for 
i,j G [r], s G [n^~^], is a copy of Ai and, respectively, Bj, from the original instance Xo. For each edge 
{u, v) with u e Ai and v G Bj of Xo, graph X has edges between the copy of u in Ai^^i and the copy of v in 
Bj^k2' for all ki,k2 G [n^""^]. This completes the description of the specialized MIN-REP instance X. 

From Specialized MIN-REP to A;-TC- Spanner. From X, we construct a graph G of diameter /c + 2 as 
follows. We first attach a disjoint generalized butterfly of diameter A; — 1, denoted BF{Ai^s)^ to each group 
Ai^s in 2r, for all i G [r], s G [n^""^]. That is, we identify vertices in Ai^g with the last strip of BF{Ai^s) 
in the way discussed below. Denote by BF{Ai) = UsBF{Ai^s) the set of all the vertices attached in 
this manner to the cluster Ai. Let BF^{Ai^s) be the vertices in strip j of the butterfly BF{Ai^s), where 
BF^{Ai^s) = A,s, and let BF^{Ai) = UsBF^{Ai^s). We call the vertices in the butterfly BF{Ai^s) at 
distance x from Ai^g the x-th shadow of Ai^g- Call the in-degree as well as out-degree of the vertices in the 
butterflies = (^)fc^. 

Next, for each Bi^s, we attach a broom, denoted BR{Bi^s). More specifically, each vertex in Bi^g is 
connected to the vertices of a set BR^~^'^{Bi^s) of size d^, and each vertex v G BR^^'^{Bi^s) is connected 
to a disjoint set of nodes, called broomsticks, of size d*. Let BR^^^{Bi^s) be the set of broomsticks 
adjacent to 5i?'=+2(Bi,s). "Lei BR^+'^{Bi) = UsBR^+'^{Bi^s) 3sidBR^+^{Bi) = UsBR^+^{Bi^s). Identify 
layer Vj with Ui^gBF^ {Ai^s) for j G [k], layer Vk+i with Ui^gBi^s, and layer Vj with \JiBR^{Bi) for 
J G {fc + 2, + 3}. Direct all the edges from Vi to Vi+i. See Figure 1. 

Attacliing butterflies. Recall that we identify vertices in Ai^s with the last strip BF^{Ai^s) of a disjoint 
butterfly, for all i G [r], s G [n^"'']. The mapping from 5 to BF^{Ai^s) is constructed in Appendix C. 
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Here we explain the requirements we impose on the mapping. Recall that each each group Ai^g has < ^ 
non-isolated vertices. For our analysis, each vertex in BF'^^^^Ai^s) must be adjacent to < ^ non-isolated 
vertices in Ai^g- The isolated vertices help us control the number of routes with shortcut edges from the 
x-shadows to the (x — 2) -shadows, for some x > 2, since connecting vertices in the 1-shadow to many 
isolated vertices decreases the number of comparable pairs in the first and last layers of Q connected by a 
path containing such a shortcut edge. 

A sparse TC-spanner H for the k-TC Spanner instance Q. Let Sq be a smallest rep-cover of Iq 
of size OPT. Recall that each Ai and Bj is replicated n^~^ times in I. Let S be the set of all replicas in I 
of vertices in Sq. Consider a A;-TC-spanner Hof Q that contains shortcuts from the nodes in layer Vk-2 to 
their descendants in 5 n V^, and from the nodes in 5 H V^+i to their descendants in The Rep-cover 

Spanner Lemma (Lemma C.2) shows that \n\ = 0{OPT ni-^(^)*^). 

3.2 Path Analysis and Rerandomization 

The next lemma shows that the A:-TC-spanner H defined above and analyzed in Lemma C.2 is nearly optimal. 

Lemma 3.3. Any k-TC-spanner K ofQ has |/C| = n {oPT-n}-^ (^^) '"^ / log . 

We introduce a bit of notation. A fc-TC-spanner iovQ = Vi U V2 U • • • U V^+a is built by adding shortcut 
edges (u, v) between comparable u and v, where u ^ Vi,v e ViJ^e, and £ > 2. For given £, i, we classify 
such a shortcut edge as type iSzi. Since Q has diameter A; -I- 2, a fc-TC-spanner for Q remains a A;-TC-spanner 
when a type i&ii edge {u, v) with ^ > 4 is replaced by a type 3&i edge (u, v'), where v' is a predecessor of v. 
Therefore, it is enough to consider fc-TC-spanners with shortcut edges only of types 2Szi for 1 < i < /c -f- 1 
and 3&i for 1 < i < A:. Say a path vr from Vi to is of type {i&ii) if it uses an edge of type iHii (with 
£ G {2, 3}), and tt is of type (2&;z, 2S^,j) if it uses edges of types 2Szi and 28z.j, i < j. Notice that the 
A;-TC-spanner constructed in Lemma C.2 contains only edges of type 2&(A; — 2) and 2&L{k + 1). 

Proof of Lemma 3.3. Given a /c-TC-spanner IC of Q with o ( j^^^* 1 OPT edges, we show that we can 

6 1 

construct a MIN-REP cover for I of size o{OPT), which is a contradiction (recall that = (^) ^-i ). 
We will accomplish this by a series of transformations which modify /C into a A;-TC-spanner that uses only 
shortcut edges of the form 2Sz{k — 2) and 2Sz{k + 1). The process increases the size of the fc-TC-spanner 
only by a logarithmic factor. Finally, we show that from the modified fe-TC-spanner, one can extract a 
MIN-REP cover of size o{OPT) for X, the desired contradiction. 

We call a superedge {Ai,Bj), where i,j G [r], deletable with respect to K, if at least 1/4 of the vertex 
pairs {u, v) G BF^{Ai) x BR^^^{Bj) have a path between them in K, of length at most k and of type other 
than {2Sz{k — 2), 2Sz{k + 1)). Our first step is to show that such cluster pairs can be essentially ignored. 

Lemma 3.4 (Patli Analysis Lemma). The number of deletable superedges with respect to IC is o{OPT). 

Proof Sketch. We call a path canonical if it contains shortcut edges of types both 2Sz{k — 2) and 2Sz{k + 1) ; 
otherwise, a path is alternative. Observe that every alternative path contains a shortcut edge from one of 
the following three categories: (1) edges that connect vertices in Vi and Vj, where i < k and j > A: + 1; 
(2) edges of type 3Szi where i < k — 3; (3) edges of type 2&;z where i < k — 3. Let Sb be the set of all 
shortcut edge types included in the three cases. By analyzing the three cases separately, we show that for 
any S G Sb, the number of superedges {Ai, Bj), G [r]^, such that at least a fraction of pairs 

{u, v) G BF'^{Ai) X BR''+^{Bj) have an alternative path containing a shortcut of type S, is o{OPT). Then 
by a union bound over S ^ SB,v/e prove the lemma. In the analysis for the first type, we use the fact that the 
degree of each non-isolated vertex of Vk is at least n^~^ which is bigger than d*. When S is of the second 
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type, we need the facts that the out-degree of each vertex in T4 is at most don^~^ and that n"^ = o{d^). For 
the third case, we use the facts that for any vertex v in Vfe_i the number of non-isolated vertices in Vk that v 
is connected to is at most — , and that = oiin?'^'). □ 

Next, form the graph Q' from Q by deleting all edges of Q connecting Ai to Bj, for all the deletable 
superedges {Ai,Bj) with respect to /C. Similarly, obtain a graph K,' from /C as follows: for all deletable 
superedges {Ai, Bj) with respect to /C, delete all edges of /C connecting Ai to Bj, and also delete all shortcuts 
in /C of types other than 2k,{k — 2) and 2&:(A; + 1). Note that for any cluster pair {Ai,Bj) of Q' , either there 
are no edges between vertices in Ai and Bj or at least | of the pairs in BF^{Ai) x BR^~^^{Bj) are connected 
by a canonical path. Also define a MIN-REP instance J' from 1 by deleting all edges in Z corresponding 
to all the deletable superedges with respect to /C. 

For ^ e [0, 1], we say a subgraph of TC{Q) is a ii-good k-TC-spanner for Q if for every (i, j) G [r]^ 
such that and Bj are comparable in C/, at least a ^ fraction of pairs {u, v) G BF^{Ai) x BR^^^{Bj) are 
connected by canonical paths in the subgraph. E.g., the graph K,' is a |-good fc-TC-spanner for Q. 

Lemma 3.5 (Rerandomization Lemma). If a \-good k-TC-spanner JC' for Q' is given, then there exists 

IC", a 1-good k-TC-spanner for Q' , such that |/C"| < 0(|/C'| • logn). 

Proof Sketch. To construct K," from K,' , we let K," be the union of 0(log n) random transformations of the 
edges of K.'. Each transformation IIj. will keep the edges of Q' invariant but move the shortcut edges. Thus, 
when we let K." = u2\°^"^nr(/C'), the edges of /C" are still a subset of the edges in TC{g'). The goal of 
the random transformations is to ensure that in Ilr{}C'), with a constant probability, each vertex in BF^{Ai) 
can reach a vertex in Vk-2 incident to a shortcut edge, and each vertex in BR^^^{Bj) is incident to a shortcut 
edge from 14+1. We achieve this by randomly permuting the groups inside the clusters Ai and Bj and by 
randomly permuting the edges of the butterfly and broom graphs attached to Ai and Bj. After these random 
transformations, any two vertices u and v in BF^{Ai) and BR^^^{Bj) are connected by a canonical path 
with probability at least j^. Hence, K," has such a path between them with probability 1 — po\y{n) ■ 
union bound over all possible {u, v) and (i, j) shows that the desired /C" with the claimed size exists. □ 

Now that the fc-TC-spanner is 1-good, it is easier to reason about rep-covers of the underlying MIN- 
REP instance. Recall that Q' has n^~^ copies of MIN-REP instance I' embedded in it. Moreover, many 
pairs of vertices in layers Vi and rely on each instance to connect. We partition the shortcut edges of 
JC" into n^~^d1 parts, according to which groups of vertex pairs in Vi x Vk+3 they can help to connect. By 
averaging, one of the parts has o{OPT) shortcut edges, and can be used to extract a rep-cover of I' of size 
o{OPT). By including two vertices for each of the o{OPT) deleted superedges, we obtain a rep-cover for 
T of size o{OPT). This is a contradiction. 

Lemma 3.6 (Rep-cover Extraction Lemma). Given K,", a 1-good k-TC-spanner for Q', of size o{OPT ■ 
n^-^ ■ dl), there exists a MIN-REP cover of I of size o{OPT). □ 

Our fi(log n)-inapproximability for 2-TC-Spanner, described in Appendix D, is based on a reduction 
from Set- Cover instead of MIN-REP. Our hard instance is a generalized butterfly of diameter 2 attached 
to an instance of transformed Set-Cover. We identify strip 3 of the butterfly with the sets in the instance, 
and using ideas similar to our proof for A; > 2 for ruling out alternative routes, show that up to a constant 
factor, the optimal 2-TC-spanner contains only shortcuts from strip 1 to a minimum set-cover in strip 3. 

4 Overview of Structural Results 

In [28], the authors implicitly give 2-TC-spanners for planar digraphs of size 0(n^/^ logn) using Lipton- 
Tarjan separators. For planar digraphs, our first idea is to instead use Thorup's planar separators [50] in 
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conjunction with the efficient /c-TC-spanners for the directed line of Alon and Schieber [5] to recursively 
construct A;-TC-spanners of size O(nlog^n). More generally, for //-minor-free graphs, using an idea in 
[50], we take an arbitrary rooted spanning tree T of the digraph G and use it to partition G into a union of 
edge-disjoint digraphs so that in each part Gi, if one undirects the edges of Gi, any undirected root path of 
T restricted to Gi is the union of at most two dipaths. Next, instead of Thorup's planar separators, we use a 
result of Abraham and Gavoille [1] that provides a "path separator" for undirected i^-minor-free graphs. 

However, the Abraham-Gavoille separators cannot be directly applied, since they do not provide enough 
flexibility in the structure of the separators. That is, these separators consist of a sequence of unions of 
minimum cost paths, where the cost function on the edges is arbitrary but specified in advance. We, however, 
need to adaptively change the cost function during the construction of the separator. Indeed, in the outermost 
level of recursion we need the path separator to lie on T, as otherwise the path separator may be the union of 
r2(n) dipaths in the underlying digraph, and therefore we cannot use the efficient fc-TC-spanner of Alon and 
Schieber [5] for the directed line in order to efficiently recurse. Thus, we specify the cost of an edge in T to 
be 1, while outside of T it is oo. However, when we partition G into subgraphs in the recursion, it may be 
that two vertices in the same subgraph no longer have a path contained in T. Since the cost function is fixed 
and the cost of any path between these two vertices is now oo, a path separator in the recursive step need 
not be contained in T, and so it may not be the union of a small number of dipaths. Thus, we again cannot 
efficiently recurse. If, however, we could change the cost function in the recursive step, we could define 
a new rooted tree in each subgraph and base our cost function on that. We observe that the proof of the 
Abraham-Gavoille separators can be used to show that their path separators satisfy this stronger property. 

Theorem 4.1. IfG is an H-minor-free graph, then it has a 2-TC-spanner of size 0(n log^ra) and, more 
generally, a k-TC-spanner of size 0{n - log n ■ Xk{n)) where Afc(-) is the k-row inverse Ackermann function. 

Acknowledgments. We would like to thank Michael Elkin, Vitaly Feldman, Cyril Gavoille, Piotr Indyk, 
T.S. Jayram, Elad Hazan, Ronitt Rubinfeld, and Adam Smith for helpful discussions. 

References 

[1] I. Abraham and C. Gavoille. Object location using path separators. In PODC, pages 188-197, 2006. 

[2] W. Ackermann. Zum Hilbertshen aufbau der reelen zahlen. Math. Ann., 99:118-133, 1928. 

[3] A. V. Aho, M. R. Garey, and J. D. UUman. The transitive reduction of a directed graph. SIAM J. 
Comput., 1(2):I31-I37, 1972. 

[4] N. Ailon and B. Chazelle. Information theory in property testing and monotonicity testing in higher 
dimension. Inf Comput., 204(1 1):I704-17I7, 2006. 

[5] N. Alon and B. Schieber. Optimal preprocessing for answering on-line product queries. Technical 
Report 71/87, Tel- Aviv University, 1987. 

[6] I. Althofer, G. Das, D. Dobkin, D. Joseph, and J. Soares. On sparse spanners of weighted graphs. 
Discrete & Computational Geometry, 9(1):8I-100, 1993. 

[7] M. Atallah, M. Blanton, and K. Frikken. Key management for non-tree access hierarchies. In ACM 
Symposium on Access Control Models and Technologies (SACMAT'06), pages 11-18, June 2006. 

[8] M. J. Atallah, K. B. Frikken, and M. Blanton. Dynamic and efficient key management for access 
hierarchies. In ACM Conference on Computer and Communications Security, pages 190-202, 2005. 



II 



[9] S. Baswana and S. Sen. Approximate distance oracles for unweighted graphs in O(n^) time. In 
SODA '04: Proceedings of the fifteenth annual ACM -SI AM symposium on Discrete algorithms, pages 
271-280, Philadelphia, PA, USA, 2004. Society for Industrial and Applied Mathematics. 

[10] T. Batu, R. Rubinfeld, and P. White. Fast approximate PCPs for multidimensional bin-packing prob- 
lems. In Proceedings of RANDOM, pages 245-256, 1999. 

[11] P. Berman and M. Karpinski. On some tighter inapproximabiUty results. Technical Report 85193-CS, 
DIMACS, 1998. 

[12] B. Chazelle. Computing on a free tree via complexity-preserving mappings. Algorithmica, 2:337-361, 
1987. 

[13] C. Chekuri, G. Even, A. Gupta, and D. Segev. Set connectivity problems in undirected graphs and the 
directed steiner network problem. In SODA, 2008. 

[14] E. Cohen. Fast algorithms for constructing t-spaimers and paths with stretch t. SIAM J. Comput., 
28(l):210-236, 1998. 

[15] E. Cohen. Polylog-time and near-linear work approximation scheme for undirected shortest paths. J. 
ACM, 47(1):132-166, 2000. 

[16] L. Cowen. Compact routing with minimum stretch. J. Algorithms, 38(1): 170-183, 2001. 

[17] L. Cowen and C. G. Wagner. Compact roundtrip routing in directed networks. J. Algorithms, 50(1):79- 
95, 2004. 

[18] Y. Dodis, O. Goldreich, E. Lehman, S. Raskhodnikova, D. Ron, and A. Samorodnitsky. Improved test- 
ing algorithms for monotonicity. In D. S. Hochbaum, K. Jansen, J. D. P. Rolim, and A. Sinclair, editors, 
RANDOM-APPROX, volume 1671 of Lecture Notes in Computer Science, pages 97-108. Springer, 
1999. 

[19] Y. Dodis and S. Khanna. Designing networks with bounded pairwise distance. In STOC, pages 750- 
759, 1999. 

[20] M. Elkin. Computing almost shortest paths. In PODC, pages 53-62, 2001. 

[21] M. Elkin and D. Peleg. Strong inapproximabiUty of the basic fc-spanner problem. In Proceedings of 
the 27th International Colloquim on Automata, Languages and Programming, 1999. 

[22] M. Elkin and D. Peleg. The client-server 2-spanner problem with applications to network design. In 
F. Comellas, J. Fabrega, and P. Fraigniaud, editors, SIROCCO, volume 8 of Proceedings in Informatics, 
pages 117-132. Carleton Scientific, 2001. 

[23] M. Elkin and D. Peleg. Approximating fe-spanner problems for k > 2. Theor Comput. ScL, 337(1- 
3):249-277, 2005. 

[24] M. EMn and D. Peleg. The hardness of approximating spanner problems. Theory of Computing 
System, 41(4):691-729, 2007. 

[25] F. Ergun, S. Kannan, S. R. Kumar, R. Rubinfeld, and M. Viswanathan. Spot-checkers. JCSS, 
60(3):717-751,2000. 



12 



[26] U. Feige. A threshold of In n for approximating set cover (preliminary version). In STOC, pages 
314-318, 1996. 

[27] E. Fischer. On the strength of comparisons in property testing. Inf. Comput., 189(1): 107-1 16, 2004. 

[28] E. Fischer, E. Lehman, I. Newman, S. Raskhodnikova, R. Rubinfeld, and A. Samorodnitsky. Mono- 
tonicity testing over general poset domains. In STOC, pages 474^83, 2002. 

[29] C. Gavoille. Personal Communication, November 2007. 

[30] O. Goldreich, S. Goldwasser, E. Lehman, D. Ron, and A. Samorodnitsky. Testing monotonicity. Com- 
binatorica, 20(3):301-337, 2000. 

[31] O. Goldreich, S. Goldwasser, and D. Ron. Property testing and its connection to learning and approx- 
imation. JACM, 45(4):653-750, 1998. 

[32] S. Halevy and E. Kushilevitz. Testing monotonicity over graph products. In J. Diaz, J. Karhumaki, 
A. Lepisto, and D. Sannella, editors, ICALP, volume 3 142 of Lecture Notes in Computer Science, 
pages 721-732. Springer, 2004. 

[33] W. Hesse. Directed graphs requiring large numbers of shortcuts. In SODA, pages 665-669, 2003. 

[34] D. Hochbaum, editor. Approximation Algorithms for NP-hard Problems. PWS PubUshing Company, 
Boston, 1997. 

[35] G. Kortsarz. On the hardness of approximating spanners. In Proceedings of the 1st workshop on Ap- 
proximation Algorithms for Combinatorial Optimization problems, pages 135-146. Springer- Verlang, 
New York, BerUn, 1998. 

[36] G. Kortsarz. On the hardness of approximating spaimers. Algorithmica, 30(3):432^50, 2001. 

[37] G. Kortsarz and D. Peleg. Generating sparse 2-spanners. J. Algorithms, 17:222-236, 1994. 

[38] C. Lund and M. Yannakakis. On the hardness of approximating minimization problems. In STOC, 
pages 286-293, 1993. 

[39] D. Peleg. Distributed computing: a locality-sensitive approach. Society for Industrial and Applied 
Mathematics, Philadelphia, PA, USA, 2000. 

[40] D. Peleg and A. A. Schaffer. Graph spanners. Journal of Graph Theory, 13(1):99-116, 1989. 

[41] D. Peleg and J. D. UUman. An optimal synchronizer for the hypercube. SIAM J. Comput, 18(4):740- 
747, 1989. 

[42] D. Peleg and E. Upfal. A trade-off between space and efficiency for routing tables. J. ACM, 36(3):510- 
530, 1989. 

[43] R. Raz and S. Safra. A sub-constant error-probability low-degree test, and a sub-constant error- 
probability pep characterization of np. In STOC, pages 475-484, 1997. 

[44] L. Roditty, M. Thorup, and U. Zwick. Roundtrip spanners and roundtrip routing in directed graphs. In 
SODA, pages 844-851, 2002. 

[45] R. Rubinfeld and M. Sudan. Robust characterization of polynomials with applications to program 
testing. SIAM Journal on Computing, 25(2):252-271, 1996. 

13 



[46] A. D. Santis, A. L. Ferrara, and B. Masucci. Efficient provably-secure hierarchical key assignment 
schemes. In MFCS, pages 371-382, 2007. 

[47] M. Thorup. On shortcutting digraphs. In WG '92: Proceedings of the 18th International Workshop on 
Graph-Theoretic Concepts in Computer Science, pages 205-211, London, UK, 1993. Springer- Verlag. 

[48] M. Thorup. Shortcutting planar digraphs. Combinatorics, Probability & Computing, 4:287-315, 1995. 

[49] M. Thorup. Parallel shortcutting of rooted trees. J. Algorithms, 23(1): 139-159, 1997. 

[50] M. Thorup. Compact oracles for reachability and approximate distances in planar digraphs. Journal 
of the ACM, 51:993-1024, 2004. 

[51] M. Thorup and U. Zwick. Compact routing schemes. In ACM Symposium on Parallel Algorithms and 
Architectures, pages 1-10, 2001. 

[52] M. Thorup and U. Zwick. Approximate distance oracles. J. ACM, 52(1): 1-24, 2005. 

[53] D. P. Woodruff. Lower bounds for additive spanners, emulators, and more. In FOCS, pages 389-398, 
2006. 

[54] A. C.-C. Yao. Space-time tradeoff for answering range queries (extended abstract). In STOC, pages 
128-136, 1982. 

[55] U. Zwick. Exact and approximate distances in graphs — A survey. Lecture Notes in Computer Science, 
2161:33+, 2001. 



A Missing Details from Section 1 

A.1 Previous Work on Other Related Problems 

Dodis and Khanna [19] study the problem of finding the minimum-cost subset of missing edges that can 
be added to a (directed) graph G, with costs and lengths associated to the missing edges, so as to ensure 
that that there is a path of length at most k between every pairs of nodes (not only those connected in G). 
Observe that /c-TC-Spanner is a special case of that problem: we can let G be the transitive reduction (see 
definitions below)of the input graph to /c-TC-Spanner, for all edges in the transitive closure of G set the 
length to 1 and cost to 1 , and for the remaining edges set the length to k and cost to 0. Given this instance, 
the algorithm of Dodis and Khanna will produce a fc-TC-spanner. However, the guarantee on the resulting 
/c-TC-spanner size is only < |G| + 0{OPTn log k), where OPT is the number of missing edges that need 
to be added. If |G| = OPT = 6(n), their algorithm may return a A;-TC-spanner with Q{n^) edges. Thus, 
in general, the resulting approximation ratio is no better than 0{n). Since their problem is more general, 
their hardness results do not apply to TC-spanners. 

Chekuri et [13] give an 0(p^/^"'"^)-approximation algorithm for the directed Steiner network problem 
where, given a digraph and node pairs {si,ti), . . . , {sp, tp), the goal is to connect all pairs with as few edges 
as possible. We can reduce fe-TC-SPANNER to this problem by specifying all comparable pairs of nodes 
in levels 1 and /c + 2 in the k + 1-extension of G (see definition 5.5 of [19]). However, their ratio is only 
0(n^+^) when p = r2(n^), and thus the resulting ratio for fc-TC-SPANNER is no better than 0{n). 
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A.2 Sparse 2-TC-spanners Imply Efficient Monotonicity Testers 

In this section we restate and prove Lemma 1.1, referred to in the introduction. The proof of the lemma 
explains how to use 2-TC-spanners to obtain efficient monotonicity testers. 

Lemma A.l. If a directed acyclic graph Gn has a 2-TC-spanner H with s{n) edges, then there exists a 
monotonicity tester on Gn that runs in time O (^^^^ ■ 

Proof. The tester selects ^i^Hl edges of the 2-TC-spanner H uniformly at random. It queries function / on 
the endpoints of all the selected edges and rejects if and only if one of the selected edges is violated by /, 
that is, f{x) > f{y) for an edge (x, y). 

If the function / is monotone on G„, the algorithm always accepts. The crux of the proof is to show that 
functions that are e-far from monotone are rejected with probability at least |. Let / : ^ R be a function 
that is e-far from monotone. It is enough to demonstrate that / violates at least ^ edges in H. Then each 
selected edge is violated with probability , and the lemma follows by elementary probability theory. 

Denote the transitive closure of G by TC{G). We say a vertex x G 14 is assigned a bad label by / if 
X has an incident violated edge in TC{Gn)', otherwise, x has a good label. Let V' be a set of vertices with 
good labels. Observe that / is monotone on the induced subgraph G' = {V', E') of TC{G). This implies 
([28], Lemma 1) that / can be changed into a monotone function by modifying it on at most \Vn — V'\ 
vertices. Since / is e-far from monotone, it shows that there are at least en vertices with bad labels. 

Every function that is e-far from monotone has a matching M of ^ violated edges in TC{G) [18]. We 
will establish an injection from the set of edges in M to the set of violated edges in H. For each edge 
(x, y) in the matching, consider the corresponding path from xXoy of length at most 2 in the 2-TC-spanner 
H. If the path is of length 1, (x, y) is the violated edge in H corresponding to the matching edge (x, y). 
Otherwise, let (x, z, y) be a path of length 2 in H. At least one of the edges (x, z) and (z, y) is violated, 
and we map (x, y) to that edge. Since M is a matching, all edges in M have distinct endpoints. Therefore, 
each edge in M is mapped to a unique violated edge in TC{G). Thus, the 2-TC-spanner H has at least ^ 
violated edges, as required. □ 

The fact that i/ is a 2-TC-spanner is crucial for the proof. If it was a fc-TC-spanner for fc > 2, the path 
of length k from x to y might not have any violated edges incident to x or y, even if /(x) > f{y). Consider 

G2n = {V2n,E) whcrC V2n = {xi, ■ ■ .,X2n},E = {(xi,X„) | i < n} U (x„,Xn+l) U {(x„+i,Xj) | j > 

n+l}.G is a 3-TC-spanner for itself. Now set f{xi) — 1 for i < n and f{xi) — otherwise. Clearly, 
this function is ^-fai from monotone, but only one edge, (x„, x^+i) is violated in the 3-TC-spanner. 

A.3 Partial Products in a Semigroup 

Chazelle [12] and Alon and Schieber also consider a generaUzation of the above problem, where the input is 

an (undirected) tree T with an element Si of a semigroup associated with each vertex i. The goal is to create 
a space-efficient data structure that allows to compute the product of elements associated with all vertices 
on the path from ito j, for all vertex pairs i,j in T. The generalized problem reduces to finding a sparsest 
/c-TC-spanner for a certain directed tree T' obtained from T by appending a new vertex to each leaf, and 
then selecting an arbitrary root and directing all edges away from it. A /c-TC-spanner for T' with s(n) edges 
yields a preprocessing scheme with space complexity s{n) for computing products on T with at most 2k 
queries as follows. The database stores a product s^j o • • • o Sy^ for each /c-TC-spanner edge {vi,vt+i) if the 
endpoints of that edge are connected by the path vi, - ■ ■ ,vt, vt+i in T'. Let LCA{u, v) denote the lowest 
common ancestor of u and v in T. Compute the product corresponding to a path from u to t; in T as follows: 
(1) if u is an ancestor of v (or vice versa) in T, query the products corresponding to the fc-TC-spanner edges 
on the shortest path from it to a child of v (from v to a child of u, respectively); (2) otherwise, make queries 
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corresponding to the /c-TC-spanner edges on the shortest path from LCA{u, v) to a child of u and on the 
shortest path from a child of LCA{u, v) nearest to u to a child of u. This gives a total of at most 2k queries. 

B Approximation Algorithms for A;-TC-Spanner and Related Problems 
B.l Algorithm for Directed /c-Spanner 

We give the algorithm for DIRECTED fc-SPANNER, which is a more general problem than /c-TC-Spanner. 
We then mention the extensions to other problems. 

Theorem B.l. For any (not necessarily constant) k > 2, there is a deterministic polynomial-time algorithm 

achieving an 0{{nlogn)^^^/'')-approximationfor DIRECTED fc-SPANNER. 

Proof. Consider the following integer programming formulation. Let OPT be the size of an optimal k- 
spanner of G. For each edge e in the input digraph G, we have a variable Xe indicating whether Xe occurs in 
the fc-spanner. Also, for each (not necessarily simple) path P containing at most k edges, we have a variable 
yp indicating whether all of the edges of P occur in the /c- spanner. 

s.t. Ve = {u,v) eG, Yl yp>'^ (1) 

Pfroiautov, \P\<k 

VP = (ei, e2, . . . , Cr), yp <ei, yp < e2, ■ . . ,yp < e^ (2) 
VeVP, Xe, ypG{0,l} (3) 

The first constraint ensures that there is at least one path of length at most k spanning each edge (n, v) in 
the spanner, while the second constraint only allows a path to be included if each of its edges is also in the 
spanner. Thus, any solution to this program is a A;-spanner, and vice versa. Notice, however, that the number 
of path variables grows exponentially with k. We can instead write this as as the following integer program: 

minEeeG^^e 

s.t. \/e = {u,v)eG, ^ -mm{Xei,Xe2, ■ ■ ■ ,Xe^) < -1 (4) 

P={ei,...,er,) frorauto V, |-P|<fe 

Ve, Xe e {0, 1} (5) 

Now the number of variables is m, where m is the number of edges of G. We relax the constraints Xe G 

{0, 1} to Xe G [0, 1]. The resulting set K we optimize over is convex since x G [0, 1]"^ and the functions 
— min(a;ei , ■ ■ ■ , Xe^.) are convex (as is their sum). We reduce the problem to a feasibility one by taking the 
convex set K' = K D {x : EeeG Xe ^t}, for a parameter t which we do binary search over. 

The problem is still that we sum over a number of terms which can be exponential in k. However, we 
design a separation oracle A which does the following: given a point x G M™, A decides whether x G K', 
and if not, provides an o G and 5 G M for which (a, x) < b but {a,y} >b for all y G K'. We will design 
an oracle for this task running in time poly(n). Later, we explain the details of algorithm A. 

There are several folklore polynomial-time algorithms for solving a convex program given a separation 
oracle A. We use the ellipsoid algorithm, which, given e > 0, runs in time poly(n) log ^ and, if the program 
is feasible, returns an x* for which the £2 -norm \x* — x\2 is at most e, where a' is a feasible solution. Setting 
e = n"®^') guarantees that x* G [0, 1]™, that Et-ec K < n"®^'') + E,,eG ^e, and for all e = (it, v) G G, 

^ -mm{x*e^,x*e^,...,x*e^) < n^c- ^ min(a:;ei , . . . , XeJ 

P=(ei,...,er) from « to I), |P|<fc P=(ei ,...,6^) fromn to d, |P|<fc 

< n-®«-l. 
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Assuming we have an oracle A described above, the following is our algorithm A:-Spanner Generation 
to construct a directed fc-spanner H of G. For a vertex v £ G, we use BFS{v) to denote the set of edges 
along a shortest path tree^ rooted at v. Clearly \BFS{v)\ = 0{n). 



fc-Spanner Generation(G): 

1. H^%. 

2. For each edge e € G, if x* > , , ,,, , H ^ HU {e}. 

3. Randomly sample r = 0((n log n)^"^/*^) vertices zi, Z2, ■ ■ ■ , Zr & G. 

4. H ^HU {UiBFS{zi)). Output 



Lemma B.2. W?Y/z probability at least 1 — 1/n, H is a k-TC-spanner ofG. 

Proof. Consider an edge {u, v) G G. Suppose there are at most (nlogn)^"^/*^ different u — v paths P of 
length at most k. By constraint (4) of the convex program and the relationship between x and x*, there exists 
suchaP= (ei, . . . ,er) forwhichmin(x*j, . . . > l/(nlog n)^~^/'^-n~®('^) > l/(2(nlogn)^~-^/'=), 
for some r < k. Thus, this path P is included in H in step 2 of A;-Spanner Generation. 

Now suppose there are more than (nlogn)^"^/'^ different u — v paths P of length at most k. Let 
= {wi^ . . . , u^s} be the set of vertices lying on at least one such path. The number oi u — v paths 
of length at most k that can be formed from s vertices is at most s^^^ . So, s^~^ = r2((nlogn)^~^/^), or 
s = r2((nlogn)^/''). The probability that {zi, 2:2, ... , Zr] n Wu,v = is at most (1 - s/nY < e"''*/'* < 
g-f2(iogn) ^ i/jT,^, for an appropriate choice of constants. 

By a union bound, with probability at least 1 — 1/n, all edges {u,v) G G for which there are more than 
(nlog n)^~^/^ different u — v paths P of length at most k satisfy {zi, Z2, ■ ■ ■ , Zr}r\ Wu,v 7^ 0- Conditioned 
on this event, for each such {u, v) £ G let z{u, 7;) be an arbitrary element in {zi, Z2, ■ ■ . , Zf} H Wu,v Then 
the path u ^ z{u,v) v along the edges of BFS{z{u,v)) is of length at most k. Indeed, there is a 
path P of length at most A; from uto v which contains z{u,v), and the path from uto v along the edges of 
BFS{z{u, v)) caimot be any longer than the length of P. □ 

Lemma B.3. \H\ = 0((n log n)^-^/'^OPT). 

Proof. Let OPT' be the optimum of the convex program. Clearly OPT' < OPT. In step (2) of k- 
Spanner Generation, at most 2(n log n)^~^/'=OPr' < 2(n log n)^"^/'=OPT edges are added to iJ. In 
step (4), 0{rn) = 0((nlogn)2-Vfc) edges are added to H. So, \H\ = 0((n log n)^-^'^) (OPT + n). We 
may assume that OPT > n — 1, as otherwise G is not connected and we can run fe-SPANNER GENERATION 
on each of its connected components. Therefore, \H\ = 0{{nlogn)^~^^^OPT). □ 

As \H\ > OPT, these lemmas show that /c-Spanner Generation isarandomizedO((nlogn)^"^/'^)- 
approximation algorithm for DIRECTED A;- Spanner. The algorithm can be derandomized by greedily 
choosing the Zi in step 3. 

Lemma B.4. For any constant k > 2, DIRECTED A-Spanner has a deterministic 0((n log n)^~^/^)- 
approximation algorithm. 

'For a directed graph, this means we take a shortest path tree of edges directed away from v, together with a shortest path tree 
of edges directed towards v. 
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Proof. In step (3) of /c-Spanner Generation, instead of sampling r random vertices, we do the follow- 
ing. For each edge (n, v) ^ G with more than (n log n)^~^/^ simple u — v paths P of length at most k, find 
the set Wu,v of all vertices lying on such a path between u and v. This can be done by computing BFS{w) 
for each vertex w e G, and checking if u w ^ v along the edges of BFS{w) is a path of length at 
most k. By averaging, there is a vertex zi which occurs in an 0((log n)^/'^/n^^^/^) fraction of the sets 
Wu,v Choose zi, delete the sets Wu,v containing zi, and repeat. This greedy algorithm finds zi,. . . ,Zr 
v/ithr = 0{{nlogn)^~^/^). □ 

The technique can also be extended to other spanners variants. 

Lemma B.5. For all constant k > 2, there are deterministic 0[{n\ogn)^^^^'^)-approximation algo- 
rithms for Client/Server Directed ^-Spanner, ^-Diameter Spanning Subgraph, and k-TC- 
Spanner. 

Proof. In the chent/server problem, we only wish to span a subset of edges of G, called client edges, and 
we may only use a subset of edges of G for spanning, called server edges. To modify our algorithm, we 
have a constraint in the linear program for each client edge rather than for all edges, and we only consider 
paths along server edges. In A;-DiAMETER Spanning Subgraph, all pairs of vertices (n, v) for which v 
is reachable from u need to be connected by a path of length at most k. For this we impose constraint (1) 
for all pairs rather than just all edges. Finally, A;-TC-Spanner is a special case of Directed A;-Spanner 
when the input is transitively closed. □ 

Moreover, A;-Spanner Generation is polynomial time provided that we can find x* in polynomial 
time. For this, it suffices to show that the running time of the separation oracle is polynomial. 

The separation oracle A first checks whether x G [0, 1]"^ and XleeG^e < t in poly(n) time, and 
provides an appropriate hyperplane if any of these constraints are violated. Assume, then, that all of these 
constraints are satisfied. Let Count(G, u,v,k) he an algorithm which outputs the number ofu — v paths of 
length at most k. The number ofu — v paths of length exactly i is just the {u, v)-ih entry of M*, where M 
is the adjacency matrix of G. Thus, we can implement Count(G, u, v, k) in poly(n) time. For a subset S of 
edges of G, Count(G \ S,u,v,k) counts the number ofu — v paths of length at most kinG which do not 
use the edges in S. 

The oracle sorts the coordinates of x, obtaining < < • • • < Xg^. Let = 0, and for z > 1, 

Si = Si-i U {cj}. The oracle computes Count(G \ Si,u,v, k) for all i > 0. From this information, for 
each i > 1 the oracle can extract Cj, the number of u — v paths of length at most k whose minimum is 
achieved by Xey Indeed, observe that Cj is just the number ofu — v paths of length at most kinG\ Sj-i 
minus the number ofu — v paths of length at most k in G\Sj. Algorithm A can now check if the constraint 
corresponding to (n, v) is satisfied, and it does this for each (n, v) G G. If the constraint for some {u, v) is 
not satisfied, for all i we set the i-th coordinate of the hyperplane a to be q, and the scalar 6 to be 1. This a, b 
pair satisfy the desired constraints, and are output by A. Note that A runs in poly(n) time for any k. □ 

B.2 /c-TC-Spanner Algorithm for Large k 

For large k, we have the following better approximation, which is specific to the k-JC Spanner problem. 

Theorem B.6. For any k > 6, there exists a deterministic approximation algorithm for the k-TC Spanner 
problem with approximation ratio 0{{n log n) / (fc^ + k log n)). 

Proof. Let G be the input digraph. Assume, w.l.o.g., that G is connected. We construct S, a /c-TC-spanner 
for the graph G such that \S\/Sk{G) < O {{nlog n) / {k'^ + k log n)). Set /c' = cA; for c to be determined. Let 
G' be G with each directed cycle contracted to a vertex, and let H = TR{G'). For each vertex v G V{H), 
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define a set S-i, of vertices such that: (i) 15^1 < 4n/k', (ii) for each u e S^, v u, and (ii) for any 
vertex w such that v w, there exists w' G Sv with dniw, w') < k' /A. One can easily see such a set 
exists by averaging and it can be efficiently constructed. Next, we define another set of vertices W C V{H) 
such that for every pair of vertices u and v such that u v, either there is a path of length at most 
3fc'/8 from u to w in if or there is a path from u to v that contains a vertex in W. The natural greedy 
algorithm for this problem constructs W to be of size at most To construct the fc-TC-spanner 

S, add to S the edges in H and for each vertex w G W, add edges from w to all vertices in Sw Also, 
for each contracted cycle in G, add an undirected star centered at one arbitrary vertex of the cycle. The 
size of S is at most \H\ + 0((n^ log n) / (/c^ + A; log n)) + 0{n). Since Sk{G) = ^{n) if G is connected, 
\S\/Sk{G) = 0((n log n)/(fc^ + fclogn)). To see that S* is a fc-TC-spanner, observe that for any pair of 
{u, v) with u ■~^G V, there will be a vertex w e W within distance 3k'/ 8 of u and a vertex w' G Syj within 
distance k' /A of v; so, if none of the involved vertices are cycles, the distance between u and in 5 is at 
most 3k' / 8 + k' /4 + 1 = 5k' /8 + 1. If the vertices u, v correspond to contracted cycles, it is easy to see 
that the path length will be at most bk' /8 + 5. We choose k' = ck to ensure 5k' /8 + 5 is at most k; this is 
always possible because k>Q. □ 

C 2 ^^g' ' "-Hardness of A;-TC-Spanner for constant k>2 

Proof of Theorem 3.2. We give a reduction from MIN-REP with unrestricted parameters, considered in [21]: 

Fact C.l ([21]). For all e G (0, 1), there is no polynomial time algorithm for the MIN-REP problem with 
approximation ratio 2i°g'"'" unless NP C DTIME{-nP''^^^°^''). 

We reduce an arbitrary MIN-REP instance on n'^' vertices to a MIN-REP instance on n vertices with 
parameters in the desired range (where n' is a suitably small constant). Since MIN-REP with unrestricted 
parameters is 2'°^ " "-inapproximable and the reduction is polynomial time, the theorem follows. The 
reduction consists of a sequence of five transformations on the original instance. We describe each of the 
transformations and specify how the parameters of the input and output MIN-REP instances are related. 

1 . ( Disjoint copies ) 

Given an (no,r-o,do)"^o)-MIN-REP instance Go with OPTq as the solution value, Ti{Go,n^^) is 
defined to be the MIN-REP instance Gi with n^^ disjoint copies of Gq. Gi is a (ni, ri, di, mi)- 
MIN-REP instance with m = n^^no, ri = n^^ro, di = do, and mi = mo. The solution value of Gi 
is OPTi = n^^OPTo because if OPTi < h^^OPTq, one could, by averaging over the n^^ copies of 
Go, extract a MIN-REP cover for Go of size smaller than OPT. 

2. (Dummy vertices inside clusters) 

Given an (ni, ri, di, mi)-MIN-REP instance Gi with OPTi as the solution value, T2{Gi,n^^) is 
defined to be the MIN-REP instance G2 obtained by increasing the size of each cluster by a factor of 
n^^ and not attaching any edges to the new vertices. G2 is a (^2, r2, d2, m2)-MIN-REP instance with 
n2 = n^^ni, r2 = r\, d2 = di, and m2 = n'^^mi. The solution value of G2 remains OPT2 = OPTi 
because the minimum cover of G2 does not include any isolated vertices. 

3. (Blowup inside clusters with matching supergraph) 

Given an (^2, r2, (i2, m2)-MIN-REP instance G2 with OPT2 as the solution value, T^{G2-,n^''^) is 
defined to be the MIN-REP instance G3 obtained as follows. For each cluster Ai in G2, construct 
a cluster A!^ in G3 consisting of n^^ copies of Ai. Let {A!j)k denote the A;th copy of Ai inside A!^. 
Whenever there is an edge in G2 between u ^ Ai and v G Bj, for each 1 < A; < n^^, add an edge 
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between the copy of u in {A'j)k and the copy of v in {Bj)^. This procedure yields a (ns, rs, ds, mz)- 
MIN-REP instance G3 where 77,3 = n^'^n2, = r2, = d2, and 7713 = m2. The solution value of 
G3 remains OPT^ = OPT2 because the supergraph corresponding to G3 and G2 are identical. 

4. (Blowup inside clusters with complete supergraph) 

Given an (713, r3, ^3, m3)-MlN-REP instance G3 with OPT-?, as the solution value, T\{G-?,.n^'^) is 
defined to be the MIN-REP instance G4 obtained as follows. For each cluster Ai in G3, construct 
a cluster A!^ in G4 consisting of n^* copies of Ai. Let {A[)k denote the fcth copy of Ai inside A!^. 
Whenever there is an edge in G3 between u ^ Ai and v E Bj, for each 1 < fci, A;2 < n^*, add an edge 
between the copy of u in {A'j)ki and the copy of v in {Bj)k2- This procedure yields a (n4, r4, ^4, m4)- 
MIN-REP instance G4 where 714 = n^*n-^, r4 = r3, ^4 = n^*d^, and 7714 = 7713. The solution value 
of Gi remains OPT^ = OPT3 because the supergraph corresponding to G3 and G4 are identical. 

5. (Tensoring) 

Given an (714, r4, (i4, 7714) -MIN-REP instance G4 with OPT4 as the solution value, T5(G4, tj,^"') is 
defined to be the MIN-REP instance G5 obtained by repeating the following construction logg n^^ 
times^. For each cluster Ai in G4, construct two clusters A'^ and A" in G5. Furthermore, A'^ contains 
two copies of Ai and A'l contains two copies of Ai. Denote the two copies inside A'^ as (^l^i and 
{A'j)2 and similarly the two copies inside A" as {A")i and {A'[)2- For each edge (n, v) in G4 with 
u G Ai and V G Bj, add the following four edges in G5: between the copy of u in (^'Ji ^^Py 
w in (^^1, between the copy of u in {A'j)2 and copy of v in {Bj)2, between the copy of u in (.4")i 
and copy of u in {Bj)i, and between the copy of u in {A'{)2 and copy of in {Bj)2. 

The procedure yields a (715, rs, ^5, r7T5)-MIN-REP instance G5 where ns = n^'^sn4, rs = n^^r4, 
4 = ^4, and ms = 7774. Also, we argue that OPT5 = n'^^-'0PT4. Clearly, OPT5 < n^^'>OPT4 
because one could choose copies of the vertices in the cover for G4 in each of the n^^ copies of the 
clusters of G4. For the other direction, notice that G5 contains n^^^ vertex disjoint copies of G4, and 
so, if OPT5 < n?^^OPT4, then by averaging, there would be a copy of G4 covered using less than 
OPT vertices, a contradiction. 

For some positive k' sufficiently smaller than k, consider an arbitrary (n** , ro, do, mo)-MIN-REP in- 
stance Go with optimum OPTq, where the only constraints on the parameters are nontriviality conditions: 

ro G [1,77'^'], do G [1,77'''], mo G [1,77"'], and OPTq G [1,277''']. Let G = r5(r4(r3(r2(ri(Go, 77^^), 77^^), 

77^=^), 77^"'), 77'^^). We choose 61,62,63,64,65 such that G is a (77, r, d, 7n,)-MIN-REP instance with r G 
[77^,77^+'"'], d G [77^,77^+"'], 777 G [77^,77^+'''] and OPT G [77^,77^+''']. By definitions of transfor- 
mations, 77 = n«'+'5i+52+<53+<54+255^ ^ g [77^i+'55,77,'''+'^i+'55], d G [77^Sr7^+K']^ g [n^^ , n'^' +^-^], and 
OPT G [„'5i+2<55^„«'+5i+2<55] Therefore, choose 64 = D, 62 = M, 65 = F - R, and 61 = 2R - F. 
All of these values are in (0, 1) by restriction of the parameters in the theorem statement. Now, since 
k' -I- (5i -I- (52 -I- (53 -I- (^4 -I- 2(55 = £> + M-|-F-|-(53-|-/«' and since D + M + F < 1 and k' can be made as small 
as we want, we can choose ^3 G (0, 1) such that 77 = n""' +^^+^2+63+64+265 _ Therefore, G is a MIN-REP 
instance with parameters in the desired range. □ 

Attaching butterflies in the construction of a hard A;-TC-Spanner instance G- We further discuss 
the way the butterflies are attached to the groups. Recall that Xq is a (r7o, r, (io, ?77)-MIN-REP instance with 
770 = 77^,r G [77''/^, 77''/^+''], do G [n^,nP~^'^] and 777 G [77^'', 77^''+'']. Thus, for each group Aij there are 
at most ^ non-isolated vertices. We will attach the butterfly BF{Aij) in such a way that each vertex in 
BF''~^{Aij) is adjacent to at most ^ non-isolated vertices in Aij, out of a total out-degree of size d*. This 

^For simplicity, we assume n^^ is a power of 2. 
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is the crucial property exploited later in the proof. We can achieve this property in the following way. Recall 
that each vertex of BF^{Aij) is labeled (ai, . . . , ak-i, s), where ai G [d*] for all I £ [k — 1], s £ [k], and 
each vertex v = (ai, . . . , ak-i, k) connects to v' = (ai, . . . , afc_2, o'^-i, k — 1). Thus, for a fixed prefix 
h= (61, 62, ■ • • , hk-2) all vertices (61, . . . , 6fc-2, ^'fe-i, k — 1) connect to the same set Ai, of vertices in Ai^j, 
and \Ai,\ = d^. Choose the set A^, to contain at most d^/m non-isolated vertices, which is possible since the 
total fraction of non-isolated vertices in ylj , is < — . 

Lemma C.2 (Rep-cover Spanner Lemma). There is ak-TC-spannerHs.t. \H\ = 0{OPT n^-\'^)'^), 
where OPT is the minimum rep-cover of the underlying Iq (and of I as well) . Moreover, H contains only 
paths of type {2k{k - 2),2k{k + 1)). 



Proof. We construct the graph H. by adding some shortcut edges to Q. Let Sq be a minimum rep-cover of 



Xq of size OPT. Recall that each Ai and Bj is replicated n times in I. Let S be the set of all replicas 

5 1 

in I of vertices in Sq. Let Ai^ and B^ i be two comparable groups of vertices. Recall that = (^) '^-i . 
To get a fc-TC-spanner on BF{Aij) U BR{Bk^i) connect each vertex v from the restriction of S to Ai^j 
with all its comparable vertices in BF^~'^{Aij). Similarly, connect each vertex in the restriction of S 
to Bk^i to its d1 comparable vertices in BR^^^{Bk,i). Since every vertex u G BF^{Ai^j) is comparable 
to every vertex v G Ai^j, it follows that there is a vertex w G BF^^'^{Aij) comparable to both u and v. 
Thus, between any such u and v there is a path using an edge of type 2&i.{k — 2). Similarly, every vertex in 
BR^^^{Bk^l) is comparable to every vertex of S^,;. By our construction, any pair of vertices (ui, nfc+3) G 
BF'^{A,_j) X BR'^+^{Bkj) isconnectedby apath'of type (2&(A; - 2), 2&(A: + 1)). In addition, any pair of 
vertices {ui,Uk+2) G BF'^Ai^j) x BR^+'^{Bk,i), as well as (^2,^^+3) G BF^{Ai^j) x BR^+^{Bk,i) are 
connected by a path of length at most k using shortcut edges of types 2k,{k — 2) and 2&(A; + 1), respectively. 
By connecting all the comparable groups Ai^j and B^^i in this manner, we obtain a A;-TC-spanner on Q. 
Since there are •n}~^ copies of each j and B^ i the total number of shortcut edges added is OPT-n}~^dl 

OPT 77/-"^ (^)^^ and we only used shortcut edges of types 2Sz{k — 2) and 2Sz{k + 1). In addition, since 
Q is transitively reduced, H must include all the edges of Q. We bound the size of Q by inspecting the total 
number of edges in the butterflies (knd^), the MIN-REP instance (< nd), and the brooms (nd* + n^~^rdl). 

Thus, \g\<kr n^-^ (^)^^^ + n^+''+''-^ + n (^)s^ + n^-^ r (^^)^. The following condi- 
tions, satisfied by the parameters of our construction, suffice to show that each term of the preceding sum is 
respectively o{\7{\). (The parameter k is omitted from the conditions, since if the inequahties are satisfied 
without K then k can be made sufficiently small to ensure that they are satisfied with k.) 

A 2 / k 5\ ^ ^2k-3 

<+(!-«)+ — (*-5j>2+(l-«) + ^(«-5j ,orC>«5^ (6) 

□ 

Lemma C.3 (Path Analysis Lemma). There are o{OPT) deletable superedges {Ai, Bf), where i, j G [r]^. 

Proof We call a path canonical if it contains shortcut edges of types both 2&;(A; — 2) and 2&(/c + 1); 
otherwise, a path is alternative. Observe that any alternative path contains at least one shortcut edge 
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from among the following three cases: (1) shortcut edges crossing both 14 and V^+i, i.e. one of the 
shortcut edge types: 3Sz{k — 2), 3Sz{k — 1), 2Sz{k — 1), 2Szk, and 3&/c; (2) shortcut edges of type 
3Sz£ where £ < — 3; (3) shortcut edges of type 2Sz£ where £ < k — 3. Let Sb be the set of all 
the shortcut edge types contained in the above three cases. Then \Sb\ = &{k). Now, for each short- 
cut edge type S G Sb, let Del{S) = G [r]^| at least fraction of paks {u,v) G BF^{Ai) x 
BR^^^{Bj) have an alternative path containing a shortcut edge of type 5} . By a union bound, the total num- 
ber of deletable superedges is at most '^s&Sb ^^K^)- Hence it suffices to show that for all S G Sb, 
Del{S)=o{OPT). 

Let C(5) = {{u,v) G BF^{Ai) x BR^'^^{Bj) \ 3 an alternative path between n and w containing 
a shortcut edge of type S}. By the definition of Del{S), since for all i G [r], \BF^{Ai)\ = ^ and 
\BR''+^{Bi)\ = n^-^dl we have 

\C{S)\>\Del{S)\-^-n'-'dl (10) 

Now we will obtain upper bounds of |C(S')| in terms of OPT for each of three cases of shortcut 
edges, thus obtaining upper bounds on Del{S). Recall that 6 = ^— t, V = 2{4k~A){Ak-2) ' ^^'^ ^ ~ 

5 (§Ef + 4^). Also, recall r G [ni,ni+T, d G [n^^-^'^+'^ ,n^^-^^+''+% and m G [n^^n^^+^^l for 

/ axl/(fe-l) 

some small enough constant k and = ( ^ 1 . We mostly ignore k below since we can make it as 

small a constant as we like. 

Suppose that 5 is a shortcut edge from the first case. Then 5 is a shortcut of type £iSz{k — £2), where 
2 < ^1 < 3,0 < £2, and £\ — £2 > 1- Now we obtain that for any shortcut of type S, the shortcut can 
be used for at most d^-^-^^ ^2+^2-^1 = ^^+2-^1 many pairs {u,v) G C{S). Hence, \C{S)\ < d^+^-^i . 

OPT"^^ < dl ■ OPT"^^^. From (10), we obtain that 

Then because 5 < 1 — (5 > 2(F^' ^"^^ ^® obtain that n^"'' is a polynomial factor larger than 
= f ^ j , which proves that |i:>e/(S')| = o{OPT). 

Now suppose that S is a shortcut type of the second case. Let 5 be type 3&£, where 1 < £ < k — 3. 
Now, from the fact that out-degree of each vertex in is at most n^-^+'n+K'^ ^ve obtain that for any shortcut 
of type S, the shortcut can be used for at most d^~^ d^~'^'^n''^~^^^'^ dl = d^~'^rS^~^^^'^d^ many pairs 
(u, v) G C{S) (ignoring k as mentioned above). Hence, upto small polynomial factors, 

\C{S)\ < dt-^^^-^^+^'dl ■ OPt"-^ = ^ OPT (11) 

logn logn 

From (10) and (11), we obtain \Del{S)\ < MSB\a:f^OPT. Now, rj < and so, \Del{S)\ = 

o{OPT). 

Now suppose that 5 is a shortcut type of the third case. Let 5 be type 2Szi, where i < k — 3. Note 
that for any vertex v in Vfe_i the number of non-isolated vertices in Vj- that v is connected to is ^. Hence, 
together with the fact that out-degree of each vertex in Vk is at most n^~^'^^'^'^, we obtain that for any 

shortcut of type 5", the shortcut can be used for at most -^-^nS many pairs (u, v) in C{S) (upto 

small polynomial factors). Then 

\C{S)\ < %-n('-'^+^dl ■ OPt""-^ = ^ OPT. (12) 
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From (10), (12), and the fact that = o(n2'?logn), v/e get \Del{S)\ < A\Sb\^;2^^0PT = o{OPT). 

ogn ^ 

Lemma C.4 (Rerandomization Lemma). If a '^-good k-TC-spanner IC' for Q' is given, then there exists 
K,", a 1-good k-TC-spanner for Q', such that |/C"| < 0(|/C'| • logn). 

Proof First, we fix some notation. Consider some (i, j) G [r]^ such that there is an edge between a vertex 
in Ai and a vertex in Bj in Q' . Let Si^j be the set of vertices in Ai that are adjacent to Bj, and let Tj j be 
the set of vertices in Bj that are adjacent to Ai. We know that at least | of the vertices in BF^{Ai) have a 
path of type {2k,{k — 2)) to Sij and at least | of the vertices in BR^^'^{Bj) have a path of length 1 from 
Tij. By a Markov argument, for at least ^ of the groups Ai^s in Au at least \ of the vertices in BF^{Ai^s) 
must have a path of type (2&(/c — 2)) to S'jj. Call the butterfly attached to such a group Ai^g an {i,j)-good 
butterfly, and call the set of vertices in BF^~'^{Ai^s) that have shortcut edges to Sij {i, j)-helpful vertices. 
Similarly, for at least | of the groups Bj^t, at least ^ of the vertices in BR''~^'^{Bj^t) have shortcut edges to 
Tij. We call the brooms attached to such groups Bj^t {hj)-80od brooms and we again call the vertices in 
BR^+'^^Bj^t) that have shortcut edges to Tj j (i, j)-helpful vertices. It will be clear from context whether a 
helpful vertex is to the left or right of the MIN-REP instance. 

Our construction of /C" ensures that in /C", for any two comparable clusters {Ai,Bj), each vertex in 
BF^{Ai) is comparable to a helpful vertex in BF''~'^{Ai) and each vertex in BR^^^{Bj) is a helpful 
vertex. This is enough to ensure that K," is 1-good fc-TC-spanner for Q' . We will construct K," to be equal 
to |J^i\°^"'' nr(/C') where each 11^ is a random transformation of /C' that moves the shortcut edges. 

Each IIj. will be the composition of several transformations on the edges of JC'. The transformations 
move only shortcut edges, but not transitive reduction edges, in /C'. Informally, the first transformation ran- 
domly permutes the groups in each cluster on the left side of the MIN-REP instance, the second randomly 
permutes the groups in each cluster of the right side of the MIN-REP instance, the third randomly permutes 
the edges of the butterfly graph, and the fourth randomly permutes the broomsticks. Formally: 

• Left Group permutations: II'^ 

For each i € [r], independently choose a random permutation vTj : [n^^''] [n^^^]. For each cluster 
Ai, if (u, v) is an edge in K,' with u, u G BF{Ai^s), then there is an edge («', v') in 11^^ (/C'), where 
u' and v' are the copies of u and v respectively in 5F(Aj^^(s)). 

• Right Group permutations: 

For each J G [r] , independently choose a random permutation tTj : [ra^"''] [n^~^]. For each cluster 
Bj, if {u, v) is an edge in /C' with n, -y G BR{Bj gi), then there is an edge v') in W3{K,'), where 
v! and v' are the copies of u and v respectively in BR{Bj,,^.(^gi^). 

• Butterfly permutations: li'^^ 

For each i G [r] and s G [n-'^"''], label a vertex u in BF{Ai^s) as (ai, 02, • . . , ak-i,m) G [ci*]'^'^"'^ x 
[k], where u G and (ai, 02, . . . , ak-i) is the usual vertex labelling that defines a generalized 
butterfly graph. Now, for every (i, s) and every (ai, . . . , 0^-3) G [d*]*^"^, independently choose 
two random permutations ^("■^-'■■■'"■''-•^^ . ^^^j ^ ^^-j^j ^j^jj ^(aiv.ifc-s) . _^ p^^. 

edge {u,w) e BF''^'^{Ai^,,) x 5F*=(Aj,s) where u = {ai, . . . ,ak-3,ak-2,ak-i,k - 2) and u: = 
(ai, . . . , afc_3, a'^_2, Q-fc-i) there exists the edge {u', w) in n^'^(/C') where n' = (ai, . . . , ak-3, 

ttJ"/'-'"*-' Vfe-2), aj"'"'""'"' Vfc-i)> ^ - 2). AH other edges in the butterfly stay fixed. 

• Broom permutations: 
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For each j G [r] and s' G [n^ independently choose a random permutation vr^ g/ : [p] — [p] 
and (T^' s' : [^] [t]- Label a vertex v G BR^^'^{Bj^s') as an element of [p] and label a vertex G 
BR^+-\Bj^s') as an element of [p] x [t] in the natural way. If (n, ij;) G BR'^+^Bj^^') x BR^+'^^iBj^s') 
is an edge in /C', then {u, w') G BR^+^{Bj^s') x BR^+^{Bj^s') is an edge in Jl^{K.'), where k;'' = 
(7rj^s/(u;i), crj^5/(u)2)) if the label of w is (lui, ■u;2). All other edges in the broom stay fixed. 

Now, for each r = 1, . . . , O(logn), define 11^ to be the composition of n'f, Il'f, II^-'^, and II'"'. For 
each r, choose all the permutations independently. As we said before, we set K" = \Jr^r{^')- 

Claim C.5. For each G [r]^ such that Ai and Bj are comparable, for any u G BF^{Ai) and v G 
BR''+^{Bj), 

Pv[u is in a {i,j)-good butterfly in Ilr{IC')] > -, Pv[v is in a {i,j)-good broom inIlr{IC')] > - 

Proof. At least half the butterflies attached to Ai are good from above, and hence, for every vertex u G 
BF^{Ai), the left group permutations ensure that with probability at least ^, the edges of a good butterfly 
are mapped to the butterfly that u belongs to. The right group permutations provide the same function for a 

V G BR^+^{Bj). □ 

Claim C.6. For each {i, j) G [r]^ such that Ai and Bj are comparable, then for any v G BR^^^{Bj): 

Pr[f is a (i, j)-helpful vertex \v is in a (i, j)-good broom] > - 

Proof. At least half of the broomsticks of a good broom are helpful (i.e., incident to a shortcut edge), and 
hence for every vertex v G BR^^^{Bj), the broom permutations ensure that with probabihty at least |, t; is 
incident to a shortcut edge. □ 

Claim C.7. For each {i, j) G [r]^ such that Ai and Bj are comparable, then for any u G BR^{Ai): 

Pt[u is comparable to a {i, j)-helpful vertex \u is in a (i, j)-good butterfly] > - 

Proof Suppose BF{Ai^s) is a (z, j)-good butterfly. For any (afc_2, afe_i) G [d*]^, let S(^ak-2,ak-i) ^ th^ set 
of vertices u in BF^{Ai,s) such that u is labelled as (04, ... , ak-3,ak~2, Ofc-i, 1) where (ai, . . . , afe-3) are 
arbitrary elements of [d*]*^^'^. Note that all the vertices in a given S(^ak-2,ak-i) comparable to the same 
set of vertices in BF^~^{Ai^s) and hence, either they are all comparable to an (i, j)-helpful vertex or none 
of them are. Hence, at least ^ of the S(^ak-2,ak-i)'^ must have every vertex comparable to a helpful vertex 
in BF''~^{Ai^s)- Now, because of the random butterfly permutations, a given vertex v G BF^{Ai^s) falls in 
such a S(^a.k-2,ak-i) probability at least □ 

Thus, for two vertices u and v in comparable Ai and Bj respectively, the probability that u and v 
are connected by a canonical path in IIr{}C') is at least ^. Since we take O(logn) independent random 
transformations 11^, the probability that u and v will be connected by a canonical path in at least one Ilr{}C') 
is at least 1 — po\y{n) ■ Taking a union bound over all vertex pairs in Ai and Bj as well as all possible i and 
j, we find that with probability at least ^, K." has a canonical path between any comparable u e Vi and 

V G Therefore, the desired K." exists and is of size at most 0(|/C'| • log n). □ 

Lemma C.8 (Rep-cover Extraction Lemma). Given K,", a 1-good k-TC-spanner for Q', of size o{OPT ■ 
n^~^ ■ dl), there exists a MIN-REP cover of I of size o{OPT). 
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Proof. For s G define /C^' to be the subgraph of K." induced by U^^^ {BF{Ai^s) U BR{Bi^s)) ■ The 

K,'g are clearly disjoint. By averaging, there exists an s such that j/Cj | < o{OPT ■ d^). 

We further partition the shortcut edges in K,'^ into dl parts. For each x, y G [d*], let ?7a;,y denote the set 
of all the nodes in Lil^iBF^{Ai^s) with butterfly coordinates (ui, . . . , Uk-2,x, y, 1), where ui, . . . , «fe-2 G 
[d*]. To partition the corresponding broomsticks, identify the nodes in BR^~^'^{Bi^s) with [d^^], and for each 
such node x G [d^], identify its descendants in BR^^^{Bi^s) with (z, 1), . . . , (z, d*). For each y G [d*], 
let U'^^y denote the set of all the broomsticks \Jl^'^BR^^^{Bi^s) with coordinates (x, y). Define K''^^x,y ^e 
the subgraph of /C" induced by the nodes comparable to the nodes in Ux^y U U'^ y. 

Observe that the shortcut edges in different /Cg'^ ^ are disjoint because (a) different U'^ y are disjoint and 
(b) the descendants in Vk-2 of different Ux,y are also disjoint. Thus, by averaging, there exist x, y such that 
^'i,x,y contains o{OPT) shortcut edges. 

Let S be the set of vertices in Vk and V^+i that are incident to shortcut edges in JC§^x,y- Then |5| < 
o(pPT). Observe that S* is a rep-cover for the MIN-REP instance X'^ obtained by restricting J' to the edges 
between Ai^s and Bj^g. This holds because in /C", each comparable pair of nodes in Ux^y x U~^y is connected 
by a canonical path. But a MIN-REP cover for Z'g is also a a MIN-REP cover for I' by definition of X. 
Finally, given a rep-cover S of I', we can get a rep-cover of X by adding at most 2 vertices per super-edge 
deleted from X to obtain I'. Since o{OPT) super-edges were deleted and since l^l < o{OPT), we obtain 
a MIN-REP cover for X of size o{OPT) . □ 

D 0(log n)-Hardness of 2-TC-Spanner 

Theorem D.l. For any k > 2, it is NP-hard to approximate the size of the sparsest k-TC-spanner within a 
ratio ofO{^ log n). In particular, 2-TC-Spanner is ri(log n)-inapproximable. 

Our proof uses a reduction from a variant of Set Cover, called (a, 6, c)-Nice Set Cover. Before 
defining this problem we define other variants of Set Cover. An instance of (a, 6)-Set Cover, consists of a 
bipartite graph G = A\J B, with = a and \B\ = 6. An instance of a-Balanced Set Cover consists of 
a bipartite graph G = A\J B, with |^| = \B\ = a. An instance of (a, c)-Balanced Bounded Set Cover 
consists of a bipartite graph G = A\J B, with |^| = \B\ = a and such that the degrees of the vertices in A 
are at most c. Finally, an instance of (a, 6, c)-Nice Set Cover consists of a bipartite graph G = A[JB, with 
1^1 = a, \B\ = b. B can be partitioned into disjoint sets Bi such that B = (J^^^Bi, \Bi\ = |, assuming ^ 
is an integer. G must satisfy the property that if G ^ is adjacent to w G Bi, for some 1 < i < a, then v 
is adjacent to every element of Bi. Moreover, v is adjacent to at most c sets Bi. A solution to all these Set 
Cover variants is a minimum number of vertices in A that cover all the vertices in B. 

Lemma D.2. It is NF-hard to approximate a solution to (n", n^, rf)-NiCB Set Cover to within a ratio of 

7 a clog nfor some constant 7, where < c < a < b. 

Proof. We will need the following fact, proved in [43]. Earlier, this result was shown under the weaker 
assumption that iVP g DTIME{n^^^°^^°s''^) [26, 38]. 

Fact D.3. There is ad> Ofor which it is NP-hard to approximate a solution to {n'^, n)-Set Cover to within 
a ratio ofj log n, for some 7 > 0. 

Claim D.4. It is NP-hard to approximate a solution to n- Balanced Set Cover to within a ratio of^ log n, 
for the same 7 as above. 

Proof. By Fact D.3, (n'^, n)-Set Cover is not approximable within a factor of 7 log n, unless P = NP. 

Using a reduction from {n'^, n)-Set Cover, if |yl| = n'^ < n, transform this instance into an instance where 
1^41 = \B\ by padding A with dummy vertices. If \A\ > n, transform this instance into an instance where 
1^1 = |-B| by padding the set B with dummy vertices and connecting them to all vertices in A. □ 
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Applying Lemma 2.3 of [36] to an instance of n'^-Balanced Set Cover, and using Claim D.4, we obtain 
the following. 

Claim D.5. It is 'HV-hard to approximate a solution to (n", rf)-Balanced Bounded Set Cover to within 
a ratio of ^ a c log n, where 7 is from Claim D.3 above. 

To complete the proof of the lemma, notice that a set M is a solution to an instance of (n", n^, n'^)- 
Nice Set Cover iff M is a solution to the instance of (n", n'')-Balanced Bounded Set Cover, resulted 
from compressing each set Bi into a single vertex bi, 1 < i < n"'. By Claim D.5 above, it follows that 
(n", n^, n'^)-Nice Set Cover is not approximable within a ratio of 7 a c log n, unless P=NP. □ 

We now prove the main theorem of this section. 

Proof of Theorem D.l. Let a = 1 + ^ and /3 = g^. Given Gi = Vk+i U Vfc+2, an instance of (n, n", n^)- 
Nice Set Cover, transform it into the following k + 2-partite graph G = Fi U ^2 U . . . V^+i U Vk+2, with 
edges directed from Vi to V^+i. Let \ Vi\ = n, 1 < i < k + 1, and Vk+2 = The induced subgraph on 
F1UV2U. .. Vfe+i is n), the butterfly graph of diameter and width n. Then|G| < /cn^'^+fc +n"+'^ = 

9 (n^+iofe) edges. Notice that there are indeed at most n""*"^ edges from Vk+i to Vk+2 since there are n 
vertices in Vk+i, each of degree at most n^'^°'~^. 

Lemma D.6. OPTs = e{OPTNSc ni). 

2 

Proof. First, we show that there is a fc-TC-spanner H of G s.t. \H\ = Q{OPTnsc n-k) edges. Then we 

2 

show that any fc-TC-spanner of G must have ^1{0PTnsc " ) edges. 

Notice that the only pairs of vertices of G that are not already at distance at most k are the comparable 
vertices u, v, with u e Vi and v G In order to connect such pairs by a directed path of length at most 
k, we need "shortcut" edges between different levels Vi and Vj,i + 2 < j. W.l.o.g., we may assume that the 
only shortcut edges used are those connecting vertices in Vi to Vi+2, for some f's. Indeed, a shortcut edge 
connecting a vertex u G to a vertex v e Vj, where j > i + 2 can be replaced with one edge connecting 
n G Vi to a vertex w G Vi+2 that is an ancestor of ti. In this way, all paths from Vi to Vk+2 that previously 
had a path of length at most k still have a path of length at most k. Define an edge e = (u, v) to be a type i 
edge if u ^ Vi and v G Vi+2. We next build a fc-TC-spanner of G with Q{OPTisfsc nk) edges. Let H be 
the smallest /c-TC-spanner of G which only uses shortcut edges of type fc — 1. 

Claim D.7. \H\ = Q{nl OPTnsc) 

Proof. Let O be a set of vertices in Vk+i that is an optimal solution to the (n, n°, n^)-Nice Set Cover 
instance. Connect each vertex t; G O to the set of all the n^/*^ ancestors of v from level Vk-i. Direct 

2 

these edges from Ay to v. Notice that we added OPT^scn-k edges and the new graph H' is a fc-TC- 
spanner. Indeed, each vertex u G Fi is comparable to each vertex v E O, and thus, there is a vertex w e Ay 
that is comparable to u. This imphes that for every u e Vi there is a path of length A; — 1 to each of the 

vertices of O, resulting in a path of length k to each vertex in Vk+2- To show that H (the minimum size 

2 

/c-TC-spanner with shortcuts only of type A; — 1) needs at least OPT^sc'iT'^ edges on top of those in G, 
assume otherwise. For v G Vk-i, let n{v) be the number of type k — 1 edges leaving from v. By assumption, 
"^vev _i ^(^) < OPTnsc nk- Each vertex in w G Vk-i has exactly a{v) = n k ancestors in Vi. For 
u G Vi, let e{u) be the total number of type A; — 1 shortcuts leaving from its descendants in Vk-i. Since 
there exists a path of length k from u to each vertex in Vk+2, it follows that e{u) > OPTICS- Notice that 

2 

T.veVk-i n{v)a{v) = EneVi ^(^) - OPTncs n. This implies that Y.veVk-i ^^^) - OPTICS nk, a 
contradiction to our assumption, concluding that \H\ = \G\ + OPT^sc nk. Next we show that \H\ = 
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0(nfe OPTnsc)- Indeed, OPTj^sc is, by construction, the same as the size of the optimal solution to 
an (n, n'^)-Balancecl Bounded Set Cover instance, where we must cover n vertices on the right with n 
vertices of degree at most on the left. This implies that OPT^sc > n^^^ = n^~5k. Now, \G\ = 

1 I _17_ 2 2 2,-. J_ I I _L8_ 

Q (n^iofe) and \H\ = \G\ + OPT^sc n^. Since OPT^scnk > sfc = n^iofc, this implies that 

\H\ = Q{OPTNscni). □ 

Let M be a sparsest spanner on G which possibly uses shortcut edges of types other than k — 1. Assume 
for the sake of contradiction that |M| < \nk OPT^sc- A vertex u e Vi can reach v G Vk+2 in at 
most k steps by using shortcut edges either of type z < A; — 1 or of type k. We will show that, under our 
assumption, there are many vertices in Vi that can reach at most ^ OPT^sc vertices in Vk+i by using 
only edges of some types i < k. Moreover, there are many vertices in Vi that reach only QPT^sc 
edges of type k. That will be enough to argue that a contradiction must occur, allowing us to conclude that 
\M\ = Q{nl OPTmsc). 

Claim D.8. Let R be the set of vertices in Vi that can reach less than ^ OPTj^sc vertices v G Vk+i in at 
most k — 1 steps in M. Then \R\ > ^. 

Proof. For each vertex u e Vi and v G Vk+i, define an indicator variable which is 1 iff there is a 
shortcut edge along the unique path from n to u in G. Consider a type i shortcut edge e = {vi, Vi+2), with 
Vi E Vi and Vi+2 € 1^+2- Then there are vertices u in Vi with u < Vi . Moreover, there are n s 
vertices v G Vfc+i with ;',+2 < v. Thus, this shortcut edge e can set at most n ^ ^ k = n k different 
Xu^v to 1. By assumption, there are less than ^ nkOPTj^sc shortcut edges of types i, where i < k — 1. \t 
follows that less than \ n OPT^sc different X^^^'s can be set to 1. For u G Vi, let n{u) be the number of 
vertices v G V^+i that u can reach in less than k steps. Thus, E„gvi < \ OPT^sc- By Markov's 

inequahty, Pr„gyjn(ti) > ^ OPTjqsc] < 2- This implies that more than ^ of the vertices u e Vi can 
reach less than ^ OPT^sc vertices v G F^+i in less than k steps. Therefore, > ^. □ 

We say that a vertex u reaches an edge e = {v, w), if there is a path from u to v. For u G Vi let t{u) be 
the number of type k edges that u reaches in M. 

Claim D.9. Let S be the set of vertices n G Fi s.t. t{u) < I OPTnsc- Then \S\ > |. 

Proof. Assuming \M\ < ^ nk OPTnsc^ there are at most | nfc OPTnsc edges of type k. Each u G 
has exactly ancestors in Vi, and therefore ^ t{u) < n}~^ ^ ni OPT^sc = \n^^^OPTNsc- 

Thus, EugvJ t(tt)] < \nkOPTNsc and by Markov's inequality, PruevAK'^) < \nkOPTMTs\ > \- □ 

Let T = R n S. The two claims above imply |r| > 1. Now we argue that a vertex i; G T cannot 
reach some vertices in Vfc+2. Recall that an instance of (n, n", n'^)-Nice Set Cover was obtained from 
an instance of (n, n^)- Balanced Bounded Set Cover, by copying each vertex on the right times, 
which means that the optimal solution to one of them is also an optimal solution to the other. Suppose we 
remove ^ nkOPT]\fsc vertices from Vfc+2- This corresponds to removing at most ^nk^^~°'OPT]\fsc = 
i n'"^ OPTnsc = OPTnsc vertices from the universe of the related (n, n'^)- Balanced Bounded 
Set Cover instance. Let OPTbsc be the size of a solution to this new Set Cover problem. Then 

OPTbsc > (1 - OPTnsc 

Suppose then that v e T could cover all of the elements in Vk+2 ■ Each such vertex u G T can cover 
vertices in Vfc+2 in exactly two ways: (1) from the ^OPTnsc vertices it reaches in Vk+i via paths of length 
< k using type i < k edges, and (2) by at most ^nk OPTnsc type k edges it can reach. Thus we must have 
OPTbsc < \ OPTnsc, which is a contradiction since OPTbsc > (1 - o{l)) OPTnsc- Thus, v eT 
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cannot reach all of Vk+2, and so the optimal fc-TC-spanner on G must have size at least nkOPT^sc We 
can them conclude that |M| = Q{ni OPTnsc)- □ 

Suppose now that we could approximate the size of the sparsest /c-TC-spanner within 71 log n for some 

71 > 0. Then, since \M\ = ©(nkOPT^sc)^ we could approximate a solution to (n, n",n^)-Nice Set 
Cover within 72 log n, for some 72 > 0. By Lemma D.2 above, (n, n", n'^)-Nice Set Cover cannot 
be approximated within 7/3 log n = 0(^)logn, unless P=NP. Therefore, the size of the sparsest fc-TC- 
spanner cannot be approximated within a factor 73 ^ log n, for some 73 > 0, unless P=NP. □ 

E Constructing Sparse /c-TC-Spanners for Path Separable Graphs 

Definition E.l ([1]). Let G be a connected undirected graph with n vertices. G is {s, m)-path separable 
(for m > n/2) if for any rooted spanning tree TofG either (1) there exists a set P of at most s monotone 
paths^ in T so that each connected component of G\P is of size at most m, or (2) for some s' < s, there 
exists a set P of s' monotone paths in T so that the largest connected component ofG\P is (s — s', m)-path 
separable. G is said to be s-path separable ifG is {s, n/2)-path separable. If G is s-path separable, let S 
be the union of at most s paths in G such that each connected component ofG\S is of size at most n/2. S is 
called the s-path separator ofG. A digraph G' is called an s-path separable digraph if the undirected graph 
underlying TR{G') is s-path separable. 

In the above definition, the number of vertices in the path separator is left unspecified. Trees are 1- 
path separable, since S can be taken to be the centroid. Similarly, graphs of treewidth ttj are (itj + l)-path 
separable. Thorup [50] showed that every planar graph is 3-path separable. Indeed, in the case of any planar 
graph G and any rooted spanning tree for it, Thorup proved that there exists a set of 3 root paths of the 
tree whose removal disconnects the graph into components of size at most n/2. Abraham and Gavoille [1] 
studied the more general case of iJ-minor-free graphs and proved the following. 

Theorem E.1 (Theorem 1 of [1]). Every H -minor-free'^ graph is s-path separable, for s = s{H), and an 
s-path separator can be computed in polynomial time. 

The definition of path separability used by Abraham and Gavoille is shghtly different from Definition E.l. 
However, the separators produced in their proof of the theorem satisfy our notion of path separability [29]. 
Our main theorem in this section is the following: 

Theorem E.2. If G is a graph drawn from a minor-closed graph family that is s-path separable, for s = 
0(1), then G has a 2-TC-spanner of size 0(n log^ n) and, more generally, a k-TC-spanner of size 0{n ■ 
logn • Afc(n)) where Afc(-) is the k-row inverse Ackermann Junction. 

Since the families of bounded treewidth, planar graphs, and if-minor-free graphs (where H is a fixed 
minor) satisfy the hypotheses of the theorem, these families have 2-TC-spanners of size 0(nlog^ n). 

Proof of Theorem E.l. First we describe a preprocessing step resembling [50] in which the digraph is di- 
vided into subgraphs so that constructing TC-spanners for each subgraph individually results in a TC- 
spanner for the entire graph. Then, we show how to efficiently construct sparse 2-TC-spanners for each 

of these path separable subgraphs. Lastly, we give the construction for general k. 

Preprocessing Step. Let G be a transitively reduced digraph. Choose an arbitrary vertex r € V{G). Let 
Lq be the set containing r and all vertices reachable from r by a directed path. For i > 1, let L2i = {u G 

'a monotone path in a rooted tree is a subpatii of a path with one endpoint at the root. 

'^A graph is called i?-minor-fTee if it belongs to a minor-closed graph family that excludes H. 
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G\ U^tg^ Lj : 3u G ^f=QLj s.t. u v) and L2i_i = {t; G G\ U^i"o^ : 3« G U^l^^Lj s.t. v ^ u}. 
Then Lq,Li,. . . ,Lt partition the vertices in G, for some integer t < n. Evidently, 

Claim E.3. For any vertices u,v E G, ifu v and ifu G Li and v G Lj, then \i — j\ < 1. 

def 

For 1 < i < t, let Gi = U Lj. By claim E.3, any two vertices with a dipath between them must 
both be contained in some Gi. Moreover, any dipath between them must lie entirely in Gj. Therefore, a 
fe-TC-spanner for G is the union of /c-TC-spanners for each Gi. Notice that J2i ^ 

We next construct a spanning tree Tq for the undirected graph underlying G that is rooted at r and has 
the following property: for any undirected path in Tq from the root, the restriction of the path to a single 
level Li consists of a single directed path. 

Tg can be constructed inductively. First, since by definition r reaches all the vertices in Lq, a spanning 
tree of Lq rooted at r can be constructed with all edges oriented away from r. Now suppose we have a tree 
Tj-i that is rooted at r, spans all vertices in U^^gLj, and whose restriction to each level 0, . . . — l consists 
of a single directed path. If i is odd, Tj_i can be extended to a tree Tj where all the new edges are oriented 
towards U*~QLj. The case when i is even is symmetric. Our desired spanning tree Tq is Tj. The following 
lemma is immediate by the construction. 

Lemma E.4. A monotone path in Tq restricted to Gi, for any i G [t], is a concatenation of<2 dipaths. 

We assume G is transitively reduced and connected. If G is not connected, we can apply our algorithm 
on each component. We describe how to construct H, a 2-TC-spanner for G of size 0{nlog^ n). 
The recursive graph fragmentation. First, we apply the preprocessing step described above to G° = G; 
that is, we obtain a spanning tree Tqo and a collection of subgraphs, Gj, • • •• definition of path 
separability, there exists a set of monotone paths on Tqo such that one of two situations happens: (1) 
all the connected components in G°\P° are of size at most n/2, (2) the largest component of G'^\P'^ is 
of size greater than n/2 and is path separable. Let G^ denote the induced subgraph of G^ on the largest 
component of G°\P°. We can apply the preprocessing to G^ to obtain a collection of subgraphs G\,GI, . . . 
and a spanning tree Tqi rooted at some arbitrary vertex in G^. Again, we find an appropriate set of paths 

in Tqi and we recurse if necessary on the largest component of G^\P^. The recursion ends when 
the graph has been disconnected into components of size at most n/2. Notice that the total number of 
paths in P° U U • • • is at most s = 6(1), and we then recurse only a constant number of times. Let 
5 po U pi U • • • . 

Connecting the cut pairs in G. Call a pair of vertices {u, v) a cut pair if u ^q v and every directed path 
from utov intersects a path in S. We show how to connect every cut pair by a path of length at most 2. 

Repeat the following for every vertex v G V{G). Let I = {i : v £ V{G^)} and, additionally, for each 
i G /, let Ji = {j : V £ Do the following for each i G / and each j G Jj. Let Pj denote the 

restriction of the paths in P* to G* . Each undirected path in Pj is a concatenation of at most 2 directed 
paths by Lemma E.4. Break up the paths in Pj into dipaths. Consider some dipath P G Pj which visits 
the vertices pi,p2, ■ ■ ■ ,Pm in that order, where m < For simplicity of presentation, assume m 

is a power of 2. For each 1 < z < log2 m, add the following two edges in H: (i) an edge from v to 
Pyi-m/2^ where yi = min^jl < y < 2^ : v ^ Py-m/2^ iri G} and (ii) an edge to v from Py^.m/2'' where 
y2 = maxj^jl < y < 2^ : Py.m/2' ^ v 'mG}.\i any of the sets inside the min or max is empty, do not add 
the respective edge. Finally, add an edge (i;, Pm) iiv ^ pm VaG and (p^, v) lipm^ v in G. Repeat this 
process for every separator dipath that is a subpath of an undirected path in Pj. 

Outer Recursion. For each connected component C of G\S, recurse on the subgraph induced by G. G is 
also path separable since the graph family is minor-closed. 

Lemma E.5. The above construction efficiently produces a 2-TC-spanner on G of size at most 0{n log^ n). 
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Proof Sketch. In each G* , the separators are nicely structured as only a constant number of directed paths. 
Hence, we can add only 0(|y(Gpl log jy(G*)|) edges in order to connect the cut pairs present in each 
G* . Since |y(G*)| < 2n and the number of G"s is 6(1), the total number of edges added in each step 
of the outer recursion is 0(n log n). The size of the remaining connected components halves after each 
graph fragmentation step. So, the outer recursion continues only log n times, making the total number of 
added edges 0{n log^ n). The construction results in a 2-TC-spanner because every pair of related vertices 
(u, v) is a cut pair at some level of the outer recursion. Then, u and v are both contained in some G*. One 
can check that the above construction ensures that both u and v are adjacent to the same vertex on some 
separator dipath intersecting a dipath from uiov. The formal proof is below. 

Proof of Lemma E.5. Let us first see why connecting every cut pair by a path of length at most 2, and 
recursing on smaller components produce a 2-TC-spanner for G. Indeed, if (u, v) is a cut pair, then the first 
step ensures a path of length at most 2 between them. If (u, v) is not a cut pair but there exist some dipaths 
from u to V, then u and v are in the same component C of G\S, and there exists a dipath between them that 
lies entirely within this component. In this case, constructing a 2-TC-spanner on the subgraph induced by 
this C suffices to connect u and u by a path of length at most 2. 

Let us now argue that this process connects every cut pair by a path of length at most 2. Consider 
some cut pair (u, v). Let i be the smallest nonnegative integer such that every dipath from uto v intersects 
a path in [Jii<iPii. Therefore, there must be a dipath from u to v entirely contained in G\ and by claim 
E.3, it follows that there is a j such that both u and v are in Cj. Suppose P G Pj is a separator dipath 
of length m (a power of 2) that intersects a dipath in G* from u to v. Let yi = m.m.y{y : u py} 
and y2 = max^jy : py '^c* yi ^ 2/2 because otherwise there cannot be a vertex on P that lies 
on a path from u to v. One possibility is that yi = y2 = m in which case the construction ensures 
that we add the edges {u,pm) and {pm,v). Otherwise, there exists some z G {1,2, .. . ,log2m} such 
that there is a unique ye {1, 2, . . . , 2^ — 1} for which y ■ m/2^ is in the interval [2/1,2/2]- Moreover, 
u ^ Py^ Py.m/2' ^ Py2 ^ ^- Therefore, the construction above adds the edges {u,Py.^/2^) 

{Py -771/2^, v). 

In connecting the cut pairs in G*-, we add at most 0{s \ V{G'j)\ log |1^(G* )|) because there are at most 
0{s) separator dipaths in G* and for any separator dipath P, each vertex in G* is connected to at most 
2(log2 + 1) < 0(log |V^(G*)|) vertices on the path. Recall that there are only a constant number 

of G^s and I^C^j)! < 2|y(G^)|. Thus, if S{G) denotes the total number of edges in the constructed 
2-TC-spanner for G, we have that: 

S{G)< S{C)+olmaxY,0{\V{G))\-log\V{G))\)\ < ^ 5(G)+0(nlogn) 

Cisac.c. of G\S \ * j / C is a c.c. of C\5 

Since |y(G)| < n/2 for any connected component of G\S, it follows that S{G) = 0(n log^ n). 

If the strong path separators can be found in polynomial time, as is guaranteed, for example, in Theo- 
rem E. 1, then it is clear that the above 2-TC-spanner can be constructed efficiently. □ 

We now prove the part of Theorem E.2 concerning fc-TC-spanners for general k. Again, assume G is 
transitively reduced and connected; now, we wish to construct H, a A;-TC-spanner for G. We perform the 
same preprocessing as before in order to obtain induced subgraphs G^,G^, . . . and a corresponding s-path 
separator S = P^ U P^ U ■ ■ ■ . Define a cut pair (n, v) to be a pair of vertices in G such that u v and 
every directed path from utov intersects a path in 5. This time, our plan is to connect all cut pairs by a path 
of length at most k and then to recurse on each of the connected components that remain after removing the 
vertices in the paths of S. By the argument used earlier, this process produces a /c-TC-spanner. 
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Now we show how to connect cut pairs {u, v) with a path of length at most k. Do the following for 
every vertex v G V{G). Let I = {i : v £ and for each i G /, let Ji = {j : v £ F(G*)}. Do the 

following for each i £ I and for each j £ Ji. Let be the restriction of the paths in to . Break up the 
undirected paths into dipaths, increasing the size of Pj by a factor of at most 2. Do the following for each 
dipath P £ Pj. Let m be the length of P which visits vertices pi,p2, ■ • in that order. Additionally, let 
c{£) be a concave increasing function of £, which satisfies c{£) < £ that we specify later. For simplicity of 
presentation, we omit all floors and ceilings. Let c*{£) denote the smallest z such that c^{i) = 0(1) where 
c^(-) denotes the zth functional power of c. For each z such that 1 < z < c*{m), add the following two 
edges to H: (i) an edge from v to Pyj^.c^^m) where yi = min^yjl <y< m/c^(m) : v Py.c^(m) in G} and 
(ii) an edge to v from Py^.c^^rn) where y2 = max^^jl < y < m/c^{m) : Py.c^(jn) ^ v'va G}. If any of the 
sets inside the min or max is empty, do not add the respective edge. 

Finally, do the following for every dipath P £ Pj that visits vertices pi,p2, ■ ■ ■ ,Pmi^ that order. 



CONNECT-ON(P) 

1. Add to H the edges in a (fc — 2) -spanner of the induced subgraph of the transitive closure of G on 

{Pc( ml ) P2c( Pm}- 

2. Remove the points in the set {Pc(m)^P2c{m)i ■ ■ ■ :Pm} from P and run CONNECT-ON on each con- 
nected component of P that remains. 

This completes our description of H. It is not too hard to see that H is indeed a /c-TC-spanner, using 
reasoning as in the previous section. The only difference is that now, for a cut pair {u, v), it could be that u 
and V are adjacent to different vertices on the separating dipath. But we have the guarantee by CONNECT- ON 
above that two path vertices in the same recursion level of CONNECT-ON have a path of length at most k — 2 
between them. Hence, it follows that u and v have a path of length at most k between them. 

Now, we bound the size of H. First, let us count the number of edges added in each step of the main 
recursion (that is, not counting the edges needed to connect pairs within components of size at most n/2). 
Denote by i{n,k) the quantity Sk{Ln), the size of the optimal /^-TC-spanner for the directed line on n 
vertices. Let us count all the edges added that are incident to some separating dipath P of size m. Denote 
this quantity /(m). By the definition of CONNECT-ON: 

/(m) < 0(n) -|- n/c{m)f{c{m)) + £{m/c{m),k — 2) 

It can be seen that f{m) is minimized when £(m/c(m), A; — 2) = 0(m) = 0(n). For example, for k = A, 
i{n, 2) = 0(n log n) and in this case, c{m) should be chosen to be logm since • log = 0(m). 
In any case, once c{m) is fixed, the solution to the above functional equation turns out to be: f{m) < 
0{n- c* ) . Also, the number of edges added to the vertices not on the separating paths is also 0{n-c* (n)) . 
Making the same arguments as in the analysis of the 2-TC-spanner, we find that the number of edges added 
in total, counting all the paths and all the G*'s is still 0{n ■ c*(n)). Since there are logn levels of the 
recursion at the top level (at each level, the size of the largest component is decreased by a factor of 2), the 
total number of edges added to H is 0{n ■ logn • c*(n)). Finally, we use the results of [5] about optimal 
/c-TC-spanners of the directed line to conclude that c*(n) = 0{Xk{n)). □ 

F NP-Hardness of /c-TC-spanner 

Theorem D.l breaks down for k = 0(log n). For these large values of k we have the following. 
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Theorem F.l. For any k < n for any e > 0, it is NP-hard to approximate the size of the sparsest 
k-TC-spanner within a factor of 1 + -j, for some 7 = (^). 

Proof of Theorem F.l. We use a reduction from 3NodeCover to show that, unless P = NP, k-TC- 
Spanner cannot be approximated within a factor of 1 + 17 (|). That is, for constant k, the problem is 
APX-hard. An instance of 3-Node Cover consists of a collection D of subsets of a universe X. Each subset 
contains at most 3 elements, and each element of X is contained in at most 2 subsets. The goal is to output 
a minimum size subcoUection M C D whose union is X. We need the following result in [1 1]: 

Lemma F.2. SNodeCover is NP-hard to approximate within a factor of 1 + c, for some constant c > 0. 

We now give a reduction from SNodeCover to fc-TC-SPANNER. For a given instance R of 3NODE- 
COVER we construct the following graph G. Let Vi be the set of vertices representing each set d E D. 
Let V2 be the set of vertices representing each element t £ X. Draw a directed edge from each vertex in 
Vi corresponding to d G D to the vertices in V2 corresponding to elements of d. Add an extra vertex a 
and draw directed edges from a to every element of Vi. For each vertex v e Vi add k — 1 new vertices 
vi,V2,---, Vk-1 and connect them via a directed path of length k passing through a,vi,V2, ■ ■ ■ , Vk-i,v in 
the given order. Call this path P{v). 

Let OPTs be the size of a minimum fc-TC-spanner on G and OPT^nc be the size of the solution to the 
initial instance R = {D, X) of SNodeCover. Let \D\ = n. 

Claim E3. OPTg = OPTmc + kn + J2deD l^^l- 

Proof. We show that there is a sparsest spanner H that contains only edges from Vi to V2, from vertex a to 
some vertices in Vi and the edges on the paths P{v), for all v eVi. 

Note that all edges from Vi to V2 need to be included in a sparsest spanner, since each of them forms a 
unique directed path connecting its endpoints. There are J2deD 1^1 ^^^^ edges. Similarly, for each vertex 
V G Vi, all the edges on the path P{v) need to be included in H. In total, there are kn such edges. 

For V G Vi, suppose that H contains an edge {vi, t), for some Vi G P{v) and t e V2. We claim that 
such an edge {vi, t) can be removed and substituted with an edge (a, u), where u e Vi s.t. u is adjacent to 
t. Indeed, all vertices on P{v) except for a are already at distance < k from t in H. Thus, to reach t from a 
it is enough to include in if a directed edge from a to any u G Vi that is adjacent to t. 

Similarly, an edge e between vertices on a path P{v) in H can be replaced by an edge (a, v). Indeed, 
such an edge e can only be useful to connect a to some vertices in V2, via a path that passes through v. 

Therefore, among the edges from a to Fl a sparsest spanner need only contain the edges that connect a 
to a minimum set of vertices in Vi that cover V2. Thus, there are exactly OPTsnc such edges. □ 

Suppose that there exists < 7 and an algorithm A that computes the size of a sparsest /c-TC-spanner 
within 1 + 7. Namely, A outputs s, such that OPTs < s < (1 + 'y)OPTs. We show that 7 > jgq^, where 
c is the constant from Lemma F.2. 

Each set d contained in an optimal solution to R covers at most 3 elements of the universe X. Therefore 
\X\ < SOPT^NC- Any element of X is contained in at most 2 sets of D, and therefore, \X\ > |. This 
implies that n < QOPT^nc- Let s' = s — {k + 2>)n. Then 

OPTING <s' < OPTING + l{OPT^NG + kn + 3n) 

< OPTING + liOPT^NG + 6kOPT3NG + I8OPT3NG) 

= OPT3ivc(l + 7(19 + 6A;)). 
Finally, using F.2, it follows that 7 > jg^^- Thus -/ = nil). 

□ 
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